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Abstract 
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the second (section 3), I discuss three special topics: (i) the status of the 
relative simultaneity relation in the context of Minkowski spacetime; (ii) 
the "geometrized" version of Newtonian gravitation theory (also known as 
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1 Introduction 



The essay that follows is divided into two parts. In the first, I give a brief 
account of the structure of classical relativity theory. 1 In the second, I discuss 
three special topics. 

My account in the first part (section 2) is limited in several respects. I do not 
discuss the historical development of classical relativity theory, nor the evidence 
we have for it. I do not treat "special relativity" as a theory in its own right 
that is superseded by "general relativity" . And I do not describe known exact 
solutions to Einstein's equation. (This list could be continued at great length. 2 ) 
Instead, I limit myself to a few fundamental ideas, and present them as clearly 
and precisely as I can. The account presupposes a good understanding of basic 
differential geometry, and at least passing acquaintance with relativity theory 
itself. 3 

In section 3, I first consider the status of the relative simultaneity relation in 
the context of Minkowski spacetime. At issue is whether the standard relation, 
the one picked out by Einstein's "definition" of simultaneity, is conventional in 
character, or is rather in some significant sense forced on us. Then I describe the 
"geometrized" version of Newtonian gravitation theory (also known as Newton- 
Cartan theory). It is included here because it helps to clarify what is and is not 
distinctive about classical relativity theory. Finally, I consider to what extent 
the global geometric structure of spacetime can be recovered from its "causal 
structure" . 4 

1 I speak of "classical" relativity theory because considerations involving quantum mechan- 
ics will play no role. In particular, there will be no discussion of quantum field theory in 
curved spacetime, or of attempts to formulate a quantum theory of gravitation. (For the 
latter, see Rovelli (this volume, chapter 12).) 

2 Two important topics that I do not consider figure centrally in other contributions to this 
volume, namely the initial value formulation of relativity theory (Earman, chapter 15), and 
the Hamiltonian formulation of relativity theory (Belot, chapter 2). 

3 A review of the needed differential geometry (and "abstract-index notation" that I use) 
can be found, for example, in Wald 119841 and Malament unpublished], (Some topics are also 
reviewed in sections 3.1 and 3.2 of Butterfield (this volume, chapter 1).) In preparing part 1, 
I have drawn heavily on a number of sources. At the top of the list are Geroch |unpublished| , 
Hawking and Ellis IT5751 . O'Neill p83l . Sachs and Wu fl977alll977bl and Wald p84l . 

4 Further discussion of the foundations of classical relativity theory, from a slightly different 
point of view, can be found in Rovelli (this volume, chapter 12). 
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2 The Structure of Relativity Theory 



2.1 Relativistic Spacetimes 

Relativity theory determines a class of geometric models for the spacetime struc- 
ture of our universe (and subregions thereof such as, for example, our solar sys- 
tem). Each represents a possible world (or world- region) compatible with the 
constraints of the theory. It is convenient to describe these models in stages. 
We start by characterizing a broad class of "relativistic spacetimes", and dis- 
cussing their interpretation. Later we introduce further restrictions involving 
global spacetime structure and Einstein's equation. 

We take a relativistic spacetime to be a pair (M,g a f,), where M is a smooth, 
connected, four-dimensional manifold, and g a b is a smooth, semi-Riemannian 
metric on M of Lorentz signature (1, 3). 5 

We interpret M as the manifold of point "events" in the world. 6 The in- 
terpretation of g a b is given by a network of interconnected physical principles. 
We list three in this section that are relatively simple in character because they 
make reference only to point particles and light rays. (These objects alone suf- 
fice to determine the metric, at least up to a constant.) In the next section, we 
list a fourth that concerns the behavior of (ideal) clocks. Still other principles 
involving generic matter fields will come up later. 

We begin by reviewing a few definitions. In what follows, let (M,g a b) be a 
fixed relativistic spacetime, and let V a be the derivative operator on M deter- 
mined by g a b, i.e., the unique (torsion-free) derivative operator on M satisfying 

5 The stated signature condition is equivalent to the requirement that, at every point p in 

1 4 

M , the tangent space M p has a basis £ a , £ a such that, for all i and j in {1, 2, 3, 4}, 



+1 if i = 1 

-1 if i = 2, 3, 4 



and § a § a = if i ^ j. (Here we are using the abstract-index notation, 'a' is an abstract index, 
while V and 'j' are normal counting indices.) It follows that given any vectors rj a = X]f=i ^ f > 
and p a = £* =1 I l a at p, 

11 22 33 44 

rf p a = kl—kl—kl—kl 

1 1 2 2 3 3 4 4 

ifrja = k k — k k — k k — k k. 

6 We use 'event' as a neutral term here and intend no special significance. Some might 
prefer to speak of "equivalence classes of coincident point events" , or "point event locations" , 
or something along those lines. 
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the compatibility condition V a gb c = 0. 

Given a point p in M, and a vector g a in the tangent space M p at p, we say 
rf is: 

timelike if ?7 a ?7a > 

mtZZ (or lightlike) if ?7 a ?7a = 
causal if ?7 a ?7 a > 

spacelike if ?7 a ?7 a < 0. 

In this way, g a b determines a "null-cone structure" in the tangent space at every 
point of M. Null vectors form the boundary of the cone. Timelike vectors form 
its interior. Spacclikc vectors fall outside the cone. Causal vectors are those that 
are either timclike or null. This classification extends naturally to curves. We 
take these to be smooth maps of the form 7: I — ► M where I C R is a (possibly 
infinite, not necessarily open) interval. 7 7 qualifies as timelike (respectively null, 
causal, spacelike) if its tangent vector field 7 is of this character at every point. 

A curve 72: 7 2 — ► M is called an (orientation preserving) reparametrization 
of the curve 71: I\ — > M if there is a smooth map r: I2 — > 7i of 72 onto 7i, with 
positive derivative everywhere, such that 72 = (71 o r). The property of being 
timelike, null, etc. is preserved under reparametrization. 8 So there is a clear 
sense in which our classification also extends to images of curves. 9 

A curve 7: I — > M is said to be a geodesic (with respect to g a b) if its tangent 
field £ a satisfies the condition: £ n V n £ a = 0. The property of being a geodesic 
is not, in general, preserved under reparametrization. So it docs not transfer 
to curve images. But, of course, the related property of being a geodesic up 
to reparametrization does carry over. (The latter holds of a curve if it can be 
reparametrized so as to be a geodesic.) 

Now we can state the first three interpretive principles. For all curves 7 : 
I —> M, 

CI 7 is timclike iff its image 7 [7] could be the worldline of a massive point 
particle (i.e., a particle with positive mass); 10 

7 If I is not an open set, we can understand smoothness to mean that there is an open 
interval / CI, with I <Z I, and a smooth map 7: I — > M, such that 7(3) = j(s) for all s£/. 

8 This follows from the fact that, in the case just described, 72 = 71, with ^ > 0. 

9 The difference between curves and curve images, i.e., between maps 7: / — > M and sets 
7[7], matters. We take worldlines to be instances of the latter, i.e., construe them as point 
sets rather than parametrized point sets. 

10 We will later discuss the concept of mass in relativity theory. For the moment, we take it 
to be just a primitive attribute of particles. 
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C2 7 can be reparametrized so as to be a null geodesic iff 7 [7] could be the 
trajectory of a light ray; 11 

PI 7 can be reparametrized so as to be a timclike geodesic iff 7 [J] could be 
the worldline of a free 12 massive point particle. 

In each case, a statement about geometric structure (on the left) is correlated 
with a statement about the behavior of particles or light rays (on the right). 

Several comments and qualifications are called for. First, we are here work- 
ing within the framework of relativity as traditionally understood, and ignoring 
speculations about the possibility of particles ("tachyons") that travel faster 
than light. (Their worldlines would come out as images of spacelike curves.) 
Second, we have built in the requirement that "curves" be smooth. So, depend- 
ing on how one models collisions of point particles, one might want to restrict 
attention here to particles that do not experience collisions. 

Third, the assertions require qualification because the status of "point par- 
ticles" in relativity theory is a delicate matter. At issue is whether one treats a 
particle's own mass-energy as a source for the surrounding metric field g a b ~ in 
addition to other sources that may happen to be present. (Here we anticipate 
our discussion of Einstein's equation.) If one does, then the curvature associated 
with g a b may blow up as one approaches the particle's worldline. And in this 
case one cannot represent the worldline as the image of a curve in M, at least 
not without giving up the requirement that g a b be a smooth field on M . For 
this reason, a more careful formulation of the principles would restrict attention 
to "test particles", i.e., ones whose own mass-energy is negligible and may be 
ignored for the purposes at hand. 

Fourth, the modal character of the assertions (i.e., the reference to possibility) 
is essential. It is simply not true, to take the case of CI, that all timelike curve 
images are, in fact, the worldlines of massive particles. The claim is that, as 

11 For certain purposes, even within classical relativity theory, it is useful to think of light 
as constituted by streams of "photons", and take the right side condition here to be "7 [J] 
could be the worldline of a photon" . The latter formulation makes C2 look more like CI and 
PI, and draws attention to the fact that the distinction between massive particles and mass 
particles (such as photons) has direct significance in terms of relativistic spacetime structure. 

12 "p ree particles" here must be understood as ones that do not experience any forces (except 
"gravity"). It is one of the fundamental principles of relativity theory that gravity arises as 
a manifestation of spacetime curvature, not as an external force that deflects particles from 
their natural, straight (geodesic) trajectories. We will discuss this matter further in section 
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least so far as the laws of relativity theory are concerned, they could be. Of 
course, judgments concerning what could be the case depend on what conditions 
are held fixed in the background. The claim that a particular curve image could 
be the worldline of a massive point particle must be understood to mean that 
it could so long as there are, for example, no barriers in the way. Similarly, 
in C2 there is an implicit qualification. We are considering what trajectories 
are available to light rays when no intervening material media are present, i.e., 
when we are dealing with light rays in vacua. 

Though these four concerns are important and raise interesting questions 
about the role of idealization and modality in the formulation of physical theory, 
they have little to do with relativity theory as such. Similar difficulties arise 
when one attempts to formulate corresponding principles within the framework 
of Newtonian gravitation theory. 

It follows from the cited interpretive principles that the metric g a b is deter- 
mined (up to a constant) by the behavior of point particles and light rays. We 
make this claim precise in a pair of propositions about "conformal structure" 
and "projective structure" . 

Let g a b be a second smooth metric of Lorentz signature on M. We say that 
cjab is conformally equivalent to g a b if there is a smooth map Q : M — > M on M 
such that (jab = Q 2 9ab- is called a conformal factor. It certainly need not be 
constant.) Clearly, if g ao and g a b are conformally equivalent, then they agree in 
their classification of vectors and curves as timelike, null, etc.. The converse is 
true as well. 13 Conformally equivalent metrics on M do not agree, in general, 
as to which curves on M qualify as geodesies or even just as geodesies up to 
reparametrization. But, it turns out, they do necessarily agree as to which null 
curves are geodesies up to reparametrization. 14 And the converse is true, once 

13 If the two metrics agree as to which vectors and curves belong to any one of the three 
categories, then they must agree on all. And in that case, they must be conformally equivalent. 
See Hawking and Ellis 119721 p. 61] . 

14 This follows because the property of being the image of a null geodesic can be captured 
in terms of the existence or non-existence of (local) timelike and null curves connecting points 
in M . The relevant technical lemma can be formulated as follows. 

A curve y. I —> M can be reparametrized so as to be a null geodesic iff 7 is null 
and for all s £ /, there is an open set O C M containing 7(3) such that, for all 
si , S2 G /, if s\ < s < S2, and if 7([si, S2]) C O, then there is no timelike curve 
from 7(^1) to 7(^2) within O. 

(Here 7([si,S2]) is the image of 7 as restricted to the interval [si, S2].) For a proof, see 
Hawking and Ellis [15721 p. 103]. 
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again. 5 

Putting the pieces together, we have the following proposition. Clauses (1) 
and (2) correspond to CI and C2 respectively. 

Proposition 2.1.1. Let g a b be a second smooth metric of Lorentz signature on 
M . Then the following conditions are equivalent. 

(1) 9ab and g a b agree as to which curves on M are timelike. 

(2) g a b and g a b agree as to which curves on M can be reparameterized so as 
to be null geodesies. 

(3) g a b and g^ are conformally equivalent. 

In this sense, the spacetime metric g a b is determined up to a conformal factor, 
independently, by the set of possible worldlines of massive point particles, and 
by the set of possible trajectories of light rays. 

Next we turn to projective structure. Let V a be a second derivative operator 
on M. We say that V a and V a are projectively equivalent if they agree as to 
which curves are geodesies up to reparametrization (i.e., if, for all curves 7, 7 
can be reparametrized so as to be a geodesic with respect to V iff it can be 
so reparametrized with respect to V a ). And if g a b is a second metric on M of 
Lorentz signature, we say that it is projectively equivalent to g a b if its associated 
derivative operator V a is projectively equivalent to V a . 

It is a basic result, due to Hermann Weyl |1921| . that if g a b and g a b are 
conformally and projectively equivalent, then the conformal factor that relates 
them must be constant. It is convenient for our purposes, with interpretive 
principle PI in mind, to cast it in a slightly altered form that makes reference 
only to timelike geodesies (rather than arbitrary geodesies). 

Proposition 2.1.2. Let g a b be a second smooth metric on M with g a b = fl 2 g a b- 
If Sab and g a b agree as to which timelike curves can be reparametrized so as to 
be geodesies, then Q is constant. 

The spacetime metric g a b, we saw, is determined up to a conformal factor, 
independently, by the set of possible worldlines of massive point particles, and 
by the set of possible trajectories of light rays. The proposition now makes 
clear the sense in which it is fully determined (up to a constant) by those sets 
together with the set of possible worldlines of free massive particles. 16 

15 For if the metrics agree as to which curves are null geodesies up to reparametrization, 
they must agree as to which vectors at arbitrary points are null, and this, we know, implies 
that the metrics are conformally equivalent. 

16 As Weyl put it p. 103], 
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Our characterization of relativistic spacetimes is extremely loose. Many fur- 
ther conditions might be imposed. For the moment, we consider just one. 

(M, g a b) is said to be temporally orientable if there exists a continuous time- 
like vector field r a on M. Suppose the condition is satisfied. Then two such 
fields r a and f a on M are said to be co-oriented if r a f a > everywhere, i.e., 
if r a and f a fall in the same lobe of the null-cone at every point of M. Co- 
orientation is an equivalence relation (on the set of continuous timelike vector 
fields on M) with two equivalence classes. A temporal orientation of (M, g a b) 
is a choice of one of those two equivalence classes to count as the "future" one. 
Thus, a non-zero causal vector £ a at a point of M is said to be future directed or 
past directed with respect to the temporal orientation T depending on whether 
T a £,a > or T a £ Q < at the point, where r a is any continuous timelike vector 
field in T. Derivatively, a causal curve 7: / — > M is said to be future directed 
(resp. past directed) with respect to T if its tangent vectors at every point are. 

In what follows, we assume that our background spacetime (M, g a b) is tem- 
porally orientable, and that a particular temporal orientation has been specified. 
Also, given events p and q in M, we write p <ti q (resp. p < q) if there is a 
future-directed timelike (resp. causal) curve that starts at p and ends at q. 17 

2.2 Proper Time 

So far we have discussed relativistic spacetime structure without reference to 
cither "time" or "space". We come to them in this section and the next. 

Let 7 : [si, S2] — > M be a future-directed timelike curve in M with tangent 
field £°. We associate with it an elapsed proper time (relative to g a b) given by 



... it can be shown that the metrical structure of the world is already fully 
determined by its inertial and causal structure, that therefore mensuration need 
not depend on clocks and rigid bodies but that light signals and mass points 
moving under the influence of inertia alone will suffice. 

(For more on Weyl's "causal-inertial" method of determining the spacetime metric, see Cole- 
man and Korte 120011 section 4.9].) 

17 It follows immediately that if p <C q, then p < q. The converse does not hold, in general. 
But the only way the second condition can be true, without the first being true as well, is if 
the only future-directed causal curves from p to q are null geodesies (or reparamctrizations of 
null geodesies). See Hawking and Ellis 119721 p. 112]. 
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This elapsed proper time is invariant under reparametrization of 7, and is just 
what we would otherwise describe as the length of (the image of) 7. The fol- 
lowing is another basic principle of relativity theory. 

P2 Clocks record the passage of elapsed proper time along their worldlines. 

Again, a number of qualifications and comments are called for. Our formu- 
lation of CI, C2, and PI was rough. The present formulation is that much 
more so. We have taken for granted that we know what "clocks" are. We have 
assumed that they have worldlines (rather than worldtubes). And we have over- 
looked the fact that ordinary clocks (e.g., the alarm clock on the nightstand) do 
not do well at all when subjected to extreme acceleration, tidal forces, and so 
forth. (Try smashing the alarm clock against the wall.) Again, these concerns 
are important and raise interesting questions about the role of idealization in 
the formulation of physical theory. (One might construe an "ideal clock" as a 
point-sized test object that perfectly records the passage of proper time along 
its worldline, and then take P2 to assert that real clocks are, under appropriate 
conditions, to varying degrees of accuracy, approximately ideal.) But as with 
our concerns about the status of point particles, they do not have much to do 
with relativity theory as such. Similar ones arise when one attempts to for- 
mulate corresponding principles about clock behavior within the framework of 
Newtonian theory. 

Now suppose that one has determined the conformal structure of spacetime, 
say, by using light rays. Then one can use clocks, rather than free particles, 
to determine the conformal factor. One has the following simple result, which 
should be compared with proposition 12 . 1 . 2l 18 

Proposition 2.2.1. Let g a b be a second smooth metric on M with g a b = fl 2 g a i,. 
Further suppose that the two metrics assign the same lengths to all timelike 
curves, i.e., Y)\g ab — \l\g ab for all timelike curves 7 : I — > M. Then 17 = 1 
everywhere. (Here \~f\g ah is the length 0/7 relative to g a b-) 

P2 gives the whole story of relativistic clock behavior (modulo the concerns 
noted above). In particular, it implies the path dependence of clock readings. If 
two clocks start at an event p, and travel along different trajectories to an event 

18 Here we not only determine the metric up to a constant, but determine the constant as 
well. The difference is that here, in effect, we have built in a choice of units for spacetime 
distance. We could obtain a more exact counterpart to proposition 12. 1 .21 if we worked, not 
with intervals of elapsed proper time, but rather with ratios of such intervals. 
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q, then, in general, they will record different elapsed times for the trip. (E.g., 
one will record an elapsed time of 3,806 seconds, the other 649 seconds.) This 
is true no matter how similar the clocks are. (We may stipulate that they came 
off the same assembly line.) This is the case because, as P2 asserts, the elapsed 
time recorded by each of the clocks is just the length of the timelike curve it 
traverses in getting from p to q and, in general, those lengths will be different. 

Suppose we consider all future-directed timelike curves from p to q. It is 
natural to ask if there are any that minimize or maximize the recorded elapsed 
time between the events. The answer to the first question is 'no'. Indeed, one 
has the following proposition. 

Proposition 2.2.2. Let p and q be events in M such that p <C q- Then, for all 
e > 0, there exists a future- directed timelike curve 7 from p to q with \j\ < e. 
(But there is no such curve with length 0, since all timelike curves have non-zero 
length.) 

Though some work is required to give the proposition an honest proof (see 
O'Neill [19831 pp. 294-5]), it should seem intuitively plausible. If there is a 
timelike curve connecting p and q, there also exists a jointed, zig-zag null curve 
that connects them. It has length 0. But we can approximate the jointed null 



long timelike curve 



short timelike curve 



Figure 2.2.1: A long timelike curve from p to q and a very 
short one that swings back- and- forth, and approximates a 
broken null curve. 



curve arbitrarily closely with smooth timelike curves that swing back and forth. 
So (by the continuity of the length function), we should expect that, for all 
e > 0, there is an approximating timelike curve that has length less than e. (See 
figure EH ) 

The answer to the second question (Can one maximize recorded elapsed time 
between p and ql) is 'yes' if one restricts attention to local regions of spacetime. 
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In the case of positive definite metrics, i.e., ones with signature of form (n, 0), 
we know, geodesies are locally shortest curves. The corresponding result for 
Lorentz metrics is that timelike geodesies are locally longest curves. 

Proposition 2.2.3. Let 7 : I — > M be a future- directed timelike curve. Then 
7 can be reparametrized so as to be a geodesic iff for all s € I, there exists an 
open set O containing j(s) such that, for all si, S2 € / with s% < s < S2, if the 
image 0/7 = 7^ S2] is contained in O, then 7 (and its reparametrizations) are 
longer than all other timelike curves in O from j(si) to 7(52). (Here 7^ ] is 
the restriction 0/7 to the interval [si,s^].) 

The proof of the proposition is very much the same as in the positive definite 
case. (See Hawking and Ellis |1972l p. 105].) Thus of all clocks passing locally 
from p to q, that one will record the greatest elapsed time that "falls freely" 
from p to q. To get a clock to read a smaller elapsed time than the maximal 
value one will have to accelerate the clock. Now acceleration requires fuel, and 
fuel is not free. So proposition 12 . 2 . 3l has the consequence that (locally) "saving 
time costs money". And proposition 12 . 2 . 2l may be taken to imply that (locally) 
"with enough money one can save as much time as one wants" . 

The restriction here to local regions of spacetime is essential. The connection 
described between clock behavior and acceleration does not, in general, hold on 
a global scale. In some relativistic spacetimes, one can find future-directed 
timelike geodesies connecting two events that have different lengths, and so 
clocks following the curves will record different elapsed times between the events 
even though both are in a state of free fall. Furthermore - this follows from the 
preceding claim by continuity considerations alone - it can be the case that 
of two clocks passing between the events, the one that undergoes acceleration 
during the trip records a greater elapsed time than the one that remains in a 
state of free fall. 

The connection we have been considering between clock behavior and accel- 
eration was once thought to be paradoxical. (I am thinking of the "clock (or 
twin) paradox".) Suppose two clocks, A and B, pass from one event to another 
in a suitably small region of spacetime. Further suppose A does so in a state of 
free fall, but B undergoes acceleration at some point along the way. Then, we 
know, A will record a greater elapsed time for the trip than B. This was thought 
paradoxical because it was believed that "relativity theory denies the possibility 
of distinguishing "absolutely" between free fall motion and accelerated motion" . 
(If we are equally well entitled to think that it is clock B that is in a state of free 
fall, and A that undergoes acceleration, then, by parity of reasoning, it should 
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be B that records the greater elapsed time.) The resolution of the paradox, if 
one can call it that, is that relativity theory makes no such denial. The situa- 
tions of A and B here are not symmetric. The distinction between accelerated 
motion and free fall makes every bit as much sense in relativity theory as it does 
in Newtonian physics. 

In what follows, unless indication is given to the contrary, a "timelike curve" 
should be understood to be a future-directed timelike curve, parametrized by 
elapsed proper time, i.e., by arc length. In that case, the tangent field £ a of 
the curve has unit length (£ a £ a = 1). And if a particle happens to have the 
image of the curve as its worldline, then, at any point, £ a is called the particle's 
four- velocity there. 



2.3 Space/Time Decomposition at a Point and Particle 
Dynamics 

Let 7 be a timelike curve representing the particle O with four- velocity field £ a . 
Let p be a point on the image of 7, and let A a be a vector at p. There is a 
natural decomposition of A a into components parallel to, and orthogonal to, £ a : 



A a = (A b 6)r + (A a -(A%)n - a) 

parallel to £ a orthogonal to £ a 

These are standardly interpreted, respectively, as the "temporal" and "spatial" 
components of A a (relative to £ a ). In particular, the three-dimensional subspace 
of M p consisting of vectors orthogonal to £ a is interpreted as the "infinitesimal" 
simultaneity slice of O at p. 19 If we introduce the tangent and orthogonal 
projection operators 

Kb = 6.6, (2) 
hab = Sab ~ 6j 6 (3) 

then the decomposition can be expressed in the form 

X a = k a b X b + h a b X b . (4) 

We can think of k ab and h ab as the relative temporal and spatial metrics deter- 
mined by £ a . They are symmetric and satisfy 

k\k\ = fc« (5) 
h\h\ = h\. (6) 

19 Here we simply take for granted the standard identification of "relative simultaneity" with 
orthogonality. We will return to consider its justification in section b.ll 
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Many standard textbook assertions concerning the kinematics and dynamics 
of point particles can be recovered using these decomposition formulas. For ex- 
ample, suppose that the worldline of a second particle O also passes through p 
and that its four- velocity at p is £ a . (Since £ a and £ a are both future-directed, 
they are co-oriented, i.e., (£ a £ a ) > 0.) We compute the speed of O as deter- 
mined by O. To do so, we take the spatial magnitude of £° relative to O and 
divide by its temporal magnitude relative to O: 

II /j a £&|| 

v = speed of O relative to O — - — ^4-^. (7) 

(Given any vector fj, a , we understand to be (/x a /^ a )^ if /i a is causal, and 

(— p. a p, a )^ if h is spacelike.) From @, 121, ©, an d ©, we have 

linfll = (k\I b kae^ = (k b ct"I C ) h = (? 6 6) (8) 

and 

linfll = (-nC^acD 4 = (-h b ct b t C ) L * = ((C"6) 2 - 1)*. (9) 

So 

Ba ^)'-1)* <L (1Q) 

Thus, as measured by O, no massive particle can ever attain the maximal speed 
1. (A similar calculation would show that, as determined by O, light always 
travels with speed 1.) For future reference, we note that 1)10(1 implies: 

It is a basic fact of relativistic life that there is associated with every 
point particle, at every event on its worldline, a four-momentum (or energy- 
momentum) vector P a . In the case of a massive particle with four-velocity £ a , 
P a is proportional to £°, and the (positive) proportionality factor is just what 
we would otherwise call the mass (or rest mass) m of the particle. So we have 
P a = m £ a . In the case of a "photon" (or other mass particle), no such 
characterization is available because its worldline is the image of a null (rather 
than timelike) curve. But we can still understand its four-momentum vector at 
the event in question to be a future-directed null vector that is tangent to its 
worldline there. If we think of the four-momentum vector P a as fundamental, 
then we can, in both cases, recover the mass of the particle as the length of P a : 
m = (P a P a )^ . (It is strictly positive in the first case, and in the second.) 
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Now suppose a massive particle O has four- velocity £ a at an event, and 
another particle, either a massive particle or a photon, has four-momentum P a 
there. We can recover the usual expressions for the energy and three-momentum 
of the second particle relative to O if we decompose P a in terms of £ a . By J3J 
and (J2J), we have 

P a = (P b £ b )C + h\P b . (12) 

The energy relative to O is the coefficient in the first term: E — P h £,b- In the 
case of a massive particle where P a = m£ a , this yields, by l|ll|) . 

E = m (i b a b ) = —^=. (i3) 

(If we had not chosen units in which c = 1, the numerator in the final expression 
would have been mc 2 and the denominator \Jl — ^.) The three-momentum 
relative to O is the second term in the decomposition, i.e., the component of P a 
orthogonal to £ a : h a b P b . In the case of a massive particle, by © and (|llfl . it 
has magnitude 

p = linmfH = m ((f 6) 2 - 1)* = ^==f- ( 14 ) 

Interpretive principle PI asserts that free particles traverse the images of 
timelike geodesies. It can be thought of as the relativistic version of Newton's 
first law of motion. Now we consider acceleration and the relativistic version of 
the second law. Let 7 : / — > M be a timelike curve whose image is the worldline 
of a massive particle O, and let £ a be the four- velocity field of O. Then the 
four- acceleration (or just acceleration) field of O is £ n Vn £ a , i.e., the directional 
derivative of £ a in the direction £ a . The four- acceleration vector is orthogonal 

toe- (This is clear, since e(eV„Ca) - §rV„(£ a £a) = ^ n Vn(l) = 0.) 
The magnitude ||£"^n£ a || °f the four-acceleration vector at a point is just 
what we would otherwise describe as the Gaussian curvature of 7 there. It is a 
measure of the degree to which 7 curves away from a straight path. (And 7 is 
a geodesic precisely if its curvature vanishes everywhere.) 

The notion of spacetime acceleration requires attention. Consider an exam- 
ple. Suppose you decide to end it all and jump off the Empire State Building. 
What would your acceleration history be like during your final moments? One is 
accustomed in such cases to think in terms of acceleration relative to the earth. 
So one would say that you undergo acceleration between the time of your jump 
and your calamitous arrival. But on the present account, that description has 
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things backwards. Between jump and arrival you are not accelerating. You are 
in a state of free fall and moving (approximately) along a spacetime geodesic. 
But before the jump, and after the arrival, you are accelerating. The floor of the 
observation desk, and then later the sidewalk, push you away from a geodesic 
path. The all-important idea here is that we are incorporating the "gravita- 
tional field" into the geometric structure of spacetime, and particles traverse 
geodesies if and only if they are acted upon by no forces "except gravity" . 

The acceleration of any massive particle, i.e., its deviation from a geodesic 
trajectory, is determined by the forces acting on it (other than "gravity"). If 
the particle has mass m > 0, and the vector field F a on 7 [J] represents the 
vector sum of the various (non-gravitational) forces acting on the particle, then 
the particle's four-acceleration £ ra V n £ a satisfies: 

F a =m£ n V w f a . (15) 

This is our version of Newton's second law of motion. 

Consider an example. Electromagnetic fields are represented by smooth, anti- 
symmetric fields F ao . (Here "anti-symmetry" is the condition that F ba = —Fab.) 
If a particle with mass m > 0, charge q, and four-velocity field £ a is present, 
the force exerted by the field on the particle at a point is given by qF a b ^ b . If 
we use this expression for the left side of Ijl5|) . we arrive at the Lorentz law of 
motion for charged particles in the presence of an electromagnetic field: 

q F%e = ™ev b c 20 (i6) 

2.4 Matter Fields 

In classical relativity theory, one generally takes for granted that all that there 
is, and all that happens, can be described in terms of various matter fields, e.g., 
material fluids and electromagnetic fields. 21 Each such field is represented by 
one or more smooth tensor (or spinor) fields on the spacetime manifold M. Each 
is assumed to satisfy field equations involving the fields that represent it and 
the spacetime metric g a b- 

20 Notice that the equation makes geometric sense. The acceleration vector on the right 
is orthogonal to £ a . But so is the force vector on the left since £ a (F a b £ 6 ) = £ a £ b F a i, = 
1 £a£b (p ab + p ba ^ and by the anti-symmetry of F ab , (F ab + F ba ) = 0. 

21 This being the case, the question arises how (or whether) one can adequately recover talk 
about "point particles" in terms of the matter fields. We will say just a bit about the question 
in this section. 
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For present purposes, the most important basic assumption about the matter 
fields is the following. 

Associated with each matter field J- is a symmetric smooth tensor 
field T a b characterized by the property that, for all points p in M, 
and all future-directed, unit timelike vectors £ a at p, T a b £ b is the 
four-momentum density of T at p as determined relative to £ a . 

T a b is called the energy-momentum field associated with T . The four-momentum 
density vector T a b £ b at p can be further decomposed into its temporal and spa- 
tial components relative to £°, just as the four-momentum of a massive particle 
was decomposed in the preceding section. The coefficient of £ a in the first com- 
ponent, T a b £ a £ b , is the energy density of T at p as determined relative to £ a . 
The second component, T„b (g an — £ a £" ) £ & , is the three-momentum density of 
T at p as determined relative to £ a . 

Other assumptions about matter fields can be captured as constraints on the 
energy-momentum tensor fields with which they are associated. Examples are 
the following. (Suppose T ao is associated with matter field J 7 .) 

Weak Energy Condition: Given any future-directed unit timelike vector £ a 
at any point in M, T ab £ a £ b > 0. 

Dominant Energy Condition: Given any future-directed unit timelike vec- 
tor £ a at any point in M, T ab £ a £; b > and T a b £ h is timelike or null. 

Conservation Condition: V a T ab = at all points in M. 

The first asserts that the energy density of T, as determined by any observer 
at any point, is non-negative. The second adds the requirement that the four- 
momentum density of T, as determined by any observer at any point, is a 
future-directed causal (i.e., timelike or null) vector. It captures the condition 
that there is an upper bound to the speed with which energy-momentum can 
propagate (as determined by any observer). It captures something of the flavor 
of principle CI in section but avoids reference to "point particles". 22 

The conservation condition, finally, asserts that the energy-momentum car- 
ried by T is locally conserved. If two or more matter fields are present in the 

22 This is the standard formulation of the dominant energy condition. The fit with CI would 
be even closer if we strengthened the condition slightly so as to be appropriate, specifically, 
for massive matter fields: at any point p in M, if T a b ^ there, then T a b (; b is timelike for all 
future-directed unit timelike vectors § a at p. 
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same region of spacetime, it need not be the case that each one individually 
satisfies the condition. Interaction may occur. But it is a fundamental assump- 
tion that the composite energy-momentum field formed by taking the sum of 
the individual ones satisfies it. Energy-momentum can be transferred from one 
matter field to another, but it cannot be created or destroyed. 

The dominant energy and conservation conditions have a number of joint 
consequences that support the interpretations just given. We mention two. The 
first requires a preliminary definition. 

Let (M, g a b) be a fixed relativistic spacetime, and let S be an achronal subset 
of M (i.e., a subset in which there do not exist points p and q such that p -C q). 
The domain of dependence D(S) of S is the set of all points p in M with this 
property: given any smooth causal curve without (past or future) endpoint, 23 if 
(its image) passes through p, then it necessarily intersects S. For all standard 
matter fields, at least, one can prove a theorem to the effect that "what happens 
on S fully determines what happens throughout D(S)" . (See Earman (this 
volume, chapter 15).) Here we consider just a special case. 

Proposition 2.4.1. Let S be an achronal subset of M. Further let T a \, be 
a smooth symmetric field on M that satisfies both the dominant energy and 
conservation conditions. Finally, assume T a (, = on S . Then T a b = on all of 
D(S). 

The intended interpretation of the proposition is clear. If energy-momentum 
cannot propagate (locally) outside the null-cone, and if it is conserved, and if 
it vanishes on S, then it must vanish throughout D(S). After all, how could it 
"get to" any point in D(S)1 Note that our formulation of the proposition does 
not presuppose any particular physical interpretation of the symmetric field T a i,. 
All that is required is that it satisfy the two stated conditions. (For a proof, see 
Hawking and Ellis (13721 p. 94].) 

The next proposition (Geroch and Jang [1975] ) shows that, in a sense, if one 
assumes the dominant energy condition and the conservation condition, then 
one can prove that free massive point particles traverse the images of timelike 
geodesies. (Recall principle PI in section 1231 ) The trick is to find a way to talk 
about "point particles" in the language of extended matter fields. 

23 Let 7 : Z — > M be a smooth curve. We say that a point p in M is a future- endpoint of 7 
if, for all open sets O containing p, there exists an so in I such that for all s £ I, if s > so, 
then 7(5) £ O, i.e., the image of 7 eventually enters and remains in O. (Past-endpoints are 
defined similarly.) 
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Proposition 2.4.2. Let 7 : I — » M be smooth curve. Suppose that given any 
open subset O of M containing "f[I], there exists a smooth symmetric field T a b 
on M such that: 

(1) T a b satisfies the dominant energy condition; 

(2) T a b satisfies the conservation condition; 

(3) Tab = outside of O; 

(4) Tab 7^ at some point in O. 

Then 7 is timelike, and can be reparametrized so as to be a geodesic. 

The proposition might be paraphrased this way. If a smooth curve in space- 
time is such that arbitrarily small free bodies could contain the image of the 
curve in their worldtubes, then the curve must be a timelike geodesic (up to 
reparametrization) . In effect, we are trading in "point particles" in favor of 
nested convergent sequences of smaller and smaller extended particles. (Bodies 
here are understood to be "free" if their internal energy-momentum is conserved. 
If a body is acted upon by a field, it is only the composite energy-momentum 
of the body and field together that is conserved.) 

Note that our formulation of the proposition takes for granted that we can 
keep the background spacetime structure (M, g a b) fixed while altering the fields 
T a i, that live on M. This is justifiable only to the extent that, in each case, T a b is 
understood to represent a test body whose effect on the background spacetime 
structure is negligible. 24 Note also that we do not have to assume at the outset 
that the curve 7 is timelike. That follows from the other assumptions. 

We have here a precise proposition in the language of matter fields that, 
at least to some degree, captures principle PI (concerning the behavior of free 
massive point particles). Similarly, it is possible to capture C2 (concerning the 
behavior of light) with a proposition about the behavior of solutions to Maxwell's 
equations in a limiting regime ( "the geometrical limit" ) where wavelengths are 
small. It asserts, in effect, that when one passes to this limit, packets of electro- 
magnetic waves are constrained to move along (images of) null geodesies. (See 
Wald [T3B51 p. 71].) 

Now we consider an example. Perfect fluids are represented by three objects: 
a four- velocity field r) a , an energy density field p, and an isotropic pressure field 

24 Stronger theorems have been proved (see Ehlers and Geroch 120041 *1 in which it is not 
required that the perturbative effect of the extended body disappear entirely at each stage of 
the limiting process, but only that, in a certain sense, it disappear in the limit. 
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p (the latter two as determined by a "co- moving" observer at rest in the fluid). 
In the special case where the pressure p vanishes, one speaks of a dust field. Par- 
ticular instances of perfect fluids are characterized by "equations of state" that 
specify p as a function of p. (Specifically excluded here are such complicating 
factors as anisotropic pressure, shear stress, and viscosity.) Though p is gener- 
ally assumed to be non-negative (see below), some perfect fluids (e.g., to a good 
approximation, water) can exert negative pressure. The energy-momentum ten- 
sor field associated with a perfect fluid is: 

Tab = pr] a r] b -p (g ab - r) a r] b ). (17) 

Notice that the energy- momentum density vector of the fluid at any point, 
as determined by a co-moving observer (i.e., as determined relative to ?y a ), is 
T a b rj b = prj a . So we can understand p, cquivalcntly, as the energy density of 
the fluid relative to rj a , i.e., T ab r\ a r/ b , or as the (rest) mass density of the fluid, 
i.e., the length of prf . (Of course, the situation here corresponds to that of a 
point particle with mass m and four- velocity 77°, as considered in section 12. 31 1 

In the case of a perfect fluid, the weak energy condition (WEC), dominant 
energy condition (DEC), and conservation condition (CC) come out as follows. 

WEC p > and p > -p 

DEC p > and p > p > -p 



CC 



(p + p)r] b ^bv a - (g ab ~V a V b )^bP = 

ry b V fc p + (p + p)(V bV b ) = 



Consider the two equations jointly equivalent to the conservation condition. 
The first is the equation of motion for a perfect fluid. We can think of it as a 
relativistic version of Euler's equation. The second is an equation of continuity 
(or conservation) in the sense familiar from classical fluid mechanics. It is easiest 
to think about the special case of a dust field (p = 0). In that case, the equation 
of motion reduces to the geodesic equation: rj b W b rj a = 0. That makes sense. 
In the absence of pressure, particles in the fluid arc free particles. And the 
conservation equation reduces to: r] b V b p + p (Vh r/ b ) = 0. The first term gives 
the instantaneous rate of change of the fluid's energy density, as determined by 
a co-moving observer. The term V& gives the instantaneous rate of change of 

its volume, per unit volume, as determined by that observer. In a more familiar 

• i 1 • dp p dV -11 

notation, the equation might be written 1 = or, equivalcntly, 

ds V ds 

— ^ — - = 0. (Here we use s for elapsed proper time.) It asserts that (in 
ds 
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the absence of pressure, as determined by a co-moving observer) the energy 
contained in an (infinitesimal) fluid blob remains constant, even as its volume 
changes. 

In the general case, the situation is more complex because the pressure in the 
fluid contributes to its energy (as determined relative to particular observers), 
and hence to what might be called its "effective mass density" . (If you compress 
a fluid blob, it gets heavier.) In this case, the WEC comes out as the requirement 
that (p+p) > in addition to p > 0. If we take h ab = (g ab — r\ a r] b ), the equation 
of motion can be expressed as: 

(p + p)v b V bV a = h ab W b p. 

This is an instance of the "second law of motion" l|15|l as applied to an (infinites- 
imal) blob of fluid. On the left we have: "effective mass density x acceleration" . 
On the right, we have the force acting on the blob. We can think of it as minus 25 
the gradient of the pressure (as determined by a co-moving observer). Again, 
this makes sense. If the pressure on the left side of the blob is greater than that 
on the right, it will move to the right. The presence of the non- vanishing term 
(p Vb?7 b ) in the conservation equation is now required because the energy of the 
blob is not constant when its volume changes as a result of the pressure. The 
equation governs the contribution made to its energy by pressure. 

2.5 Einstein's Equation 

Once again, let (M, g ab ) be our background relativistic spacetime with a speci- 
fied temporal orientation. 

It is one of the fundamental ideas of relativity theory that spacetime structure 
is not a fixed backdrop against which the processes of physics unfold, but instead 
participates in that unfolding. It posits a dynamical interaction between the 
spacetime metric in any region and the matter fields there. The interaction is 
governed by Einstein's field equation: 

Rab - 2 R 9ab - ^gab = &nT ab , (18) 

or, equivalently, 

Rab = 8-K(T ab - -T g ab ) - \g ab . (19) 

25 The minus sign comes in because of our sign conventions. 
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Here A is the cosmological constant, R a b{— R n a bn) ' IS the Ricci tensor field, 
R {= R a a ) is the Riemann scalar curvature field, and T is the contracted field 
T a a , 26 We start with four remarks about and then consider an alternative 
formulation that provides a geometric interpretation of sorts. 

(1) It is sometimes taken to be a version of "Mach's principle" that "the 
spacetime metric is uniquely determined by the distribution of matter" . And 
it is sometimes proposed that the principle can be captured in the requirement 
that "if one first specifies the energy-momentum distribution T a b on the space- 
time manifold M , then there is exactly one (or at most one) Lorentzian metric 
g a b on M that, together with T a t, satisfies (|T%|) ". But there is a serious prob- 
lem with the proposal. In general, one cannot specify the energy-momentum 
distribution in the absence of a spacetime metric. Indeed, in typical cases the 
metric enters explicitly in the expression for T^. (Recall the expression <|17fl 
for a perfect fluid.) Thus, in looking for solutions to <(T%|) . one must, in general, 
solve simultaneously for the metric and matter field distribution. 

(2) Given any smooth metric g a b on M, there certainly exists a smooth 
symmetric field T a b on M that, together with g a b, is a solution to (|18|) . It suffices 
to define T a b by the left side of the equation. But the field T a b so introduced 
will not, in general, be the energy-momentum field associated with any known 
matter field. It will not even satisfy the weak energy condition discussed in 
section EP1 With the latter constraint on T a \, in place, Einstein's equation is an 
entirely non-trivial restriction on spacetime structure. 

Discussions of spacetime structure in classical relativity theory proceed on 
three levels according to the stringency of the constraints imposed on T a b. At 
the first level, one considers only "exact solutions", i.e., solutions where T a f, 
is, in fact, the aggregate energy-momentum field associated with one or more 
known matter fields. So, for example, one might undertake to find all perfect 
fluid solutions exhibiting particular symmetries. At the second level, one con- 
siders the larger class of what might be called "generic solutions", i.e., solutions 
where T a b satisfies one or more generic constraints (of which the weak and dom- 
inant energy conditions are examples). It is at this level, for example, that the 
singularity theorems of Penrose and Hawking (Hawking and Ellis |1972p are 
proved. Finally, at the third level, one drops all restrictions on T a b, and Ein- 
stein's equation plays no role. Many results about global structure are proved 
at this level, e.g., the assertion that there exist closed timelike curves in any 

26 We use "geometrical units" in which the gravitational constant G, as well as the speed of 
light c, is 1. 
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relativistic spacetime (M, g a b) where M is compact. 

(3) The role played by the cosmological constant in Einstein's equation re- 
mains a matter of controversy. Einstein initially added the term {—\g a b) in 1917 
to allow for the possibility of a static cosmological model (which, at the time, 
was believed necessary to properly represent the actual universe). 27 But there 
were clear problems with doing so. In particular, one does not recover Poisson's 
equation (the field equation of Newtonian gravitation theory) as a limiting form 
of Einstein's equation unless A = 0. (See point (4) below.) Einstein was quick to 
revert to the original form of the equation after Hubble's redshift observations 
gave convincing evidence that the universe is, in fact, expanding. (That the 
theory suggested the possibility of cosmic expansion before those observations 
must count as one of its great successes.) Since then the constant has often 
been reintroduced to help resolve discrepancies between theoretical prediction 
and observation, and then abandoned when the (apparent) discrepancies were 
resolved. The controversy continues. Recent observations indicating an acceler- 
ating rate of cosmic expansion have led many cosmologists to believe that our 
universe is characterized by a positive value for A. (See Earman |2()()1| for an 
overview.) 

Claims about the value of the cosmological constant are sometimes cast as 
claims about the "energy-momentum content of the vacuum" . This involves 
bringing the term (— Xg a b) from the left side of equation (|T%)) to the right, and 
re-interpreting it as an energy- momentum field, i.e., taking Einstein's equation 
in the form 

Rab - \ R9ab = 8 7T (T ab + T^ AC ), (20) 

where TY h AC — — g a b. Here T a b is still understood to represent the aggregate 
energy- momentum of all normal matter fields. But T^ b AC is now understood to 
represent the residual energy-momentum associated with empty space. Given 
any unit timclike vector £ a at a point, {T^ b AC £° £ b ) is — . So, on this re- 
interpretation, A comes out (up to the factor 87r) as the energy- density of the 
vacuum as determined by any observer, at any point in spacetime. 

It should be noted that there is a certain ambiguity involved in referring to A 
as the cosmological constant (and a corresponding ambiguity as to what counts 
as a solution to Einstein's equation). We can take (M, g a b,T a b) to qualify if it 
satisfies the equation for some value (or other) of A. Or, more stringently, one 
can take it to qualify if it satisfies the equation for some value of A that is fixed, 

27 He did so for other reasons as well (see Earman I20U1I ). but I will pass over them here. 
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once and for all, i.e., the same for all models (M, g a b, T a b). In effect, we have 
here two versions of "relativity theory" . (See Earman |2()()3| for discussion of 
what is at stake in choosing between the two.) 

(4) It is instructive to consider the relation of Einstein's equation to Poisson's 
equation, the field equation of Newtonian gravitation theory: 

V 2 = 4tt ( o. (21) 

Here (j) is the Newtonian gravitational potential, and p is the Newtonian mass 
density function. In the "geometrized" formulation of the theory that we will 
consider in section T3. 21 one trades in the potential <fi in favor of a curved deriva- 
tive operator, and Poisson's equation comes out as 

R ab = Airptab, (22) 

where R a b is the Ricci tensor field associated with the new curved derivative 
operator, and t a b is the temporal metric. 

The geometrized formulation of Newtonian gravitation was discovered after 
general relativity (in the 1920s). But now, after the fact, we can put ourselves in 
the position of a hypothetical investigator who is considering possible candidates 
for a relativistic field equation, and knows about the geometrized formulation of 
Newtonian theory. What could be more natural than the attempt to adopt or 
adapt In the empty space case (p = 0), this strategy suggests the equation 

R a b = 0, which is, of course, Einstein's equation l|19|) for T a & = and A = 0. 
This seems to me, by far, the best route to the latter equation. Start with the 
Newtonian empty space equation (R a b = 0) and then simply leave it intact! 

No such simple extrapolation is possible in the general case (p ^ 0). Indeed, 
I know of no heuristic argument for the full version of Einstein's equation (with 
or without cosmological constant) that is nearly so convincing. But one can 
try something like the following. The closest counterparts to (|22|l would seem 
to be ones of the form: R a b = 4^K ab , where K a b is a symmetric tensorial 
function of T a b and g a b- The possibilities for K a b include T a b, g a bT, T a m T m b, 
9ab (T mn T mn ), and linear combinations of these terms. All but the first two 
involve terms that are second order or higher in T a b- So, for example, in the 
special case of a dust field with energy density p and four- velocity r] a , they will 
contain occurrences of p n with n > 2. (E.g., g a b (T mn T mn ) comes out as p 2 g a b-) 
But, presumably, only terms first order in p should appear if the equation is to 
have a proper Newtonian limit. This suggests that we look for a field equation 
of the form 

R ab =4Tr[kT ab + lg ab T} (23) 
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or, equivalcntly. 



2N 



Rab 



I 



Rg a b = 4:irkT ab , 



(24) 



(fc + 4/) 

for some real numbers k and I. Let G a b(k, I) be the field on the left side of the 
equation. It follows from the conservation condition that the field on the right 
side is divergence free, i.e., V a (A tt k T ab ) = 0. So the conservation condition 
and (|24|l can hold jointly only if 

V a G ab (k,l) = 0. 

But by the "Bianchi identity" (Wald pgl pp. 39-40]), 

1 



V a (R 

The latter two conditions imply 

I 
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Rg ab ) 
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Now V a (Rg ab ) = is an unreasonable constraint. 29 So the initial scalar term 
must be 0. Thus, we are left with the conclusion that the conservation condition 
and (|24|l can hold jointly only if k + 21 = 0, in which case l|23|l reduces to 



R ab = 4 it k 



Tab — — 9ab T 



(26) 



It remains to argue that k must be 2 if 12tjfl is to have a proper Newtonian 
limit. To do so, we consider, once again, the special case of a dust field with 
energy density p and four- velocity if. Then, T a h = prj a rn,, and T = p. If wc 
insert these values in l|26() and contract with rf rj b , we arrive at 



R a bV a V = ^TTkp. 



(27) 



Now the counterpart to a four-velocity field in Newtonian theory is a vector 
field of unit temporal length, i.e., a field rf where t a b rf rf = 1. If we contract 
the geometrized version of Poisson's equation IL'l'l) with vf rj b , we arrive at: 

28 Contraction on 'a' and 'b' in 1231 yields: R = iir (fc+4i) T. Solving for T, and substituting 
for T in yields I2H . 

29 It implies that R is constant and, hence, if 1231 holds, that T is constant (since 1231 
implies R = 4iz(k + 41) T). But this, in turn, is an unreasonable constraint on the energy- 
momentum distribution T ab . E.g., in the case of a dust field with T a (, = prj a r) h , T = p, and so 
the constraint implies that p is constant. This is unreasonable since it rules out any possibility 
of cosmic expansion. (Recall the discussion toward the close of section f2.4l 'l 
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RabV a 7 7 fc — 47rp. Comparing this expression for R a bV a V b with that in (|27() . 
we are led to the conclusion that k = 2, in which case l|26|) is just Einstein's 
equation (|19|l with A = 0. 

Summarizing now, we have suggested that if one starts with the geometrized 
version of Poisson's equation l|22|l and looks for a relativistic counterpart, one is 
plausibly led to Einstein's equation with A = 0. It is worth noting that if we had 
started instead with a variant of (|22|l incorporating a "Newtonian cosmological 
constant" 

Rab + Xt ab = iTrpt ab , (28) 

we would have been led instead to Einstein's equation (|TU|l without restriction 
on A. We can think of l|28|l as the geometrized version of 

V 2 ^ + A = 47rp. (29) 

Let's now put aside the question of how one might try to motivate Ein- 
stein's equation. However one arrives at it, the equation - let's now take it in 
the form l|18|) - can be understood to assert a dynamical connection between 
a certain tensorial measure of spacetime curvature (on the left side) and the 
energy- momentum tensor field (on the right side). It turns out that one can 
reformulate the connection in a way that makes reference only to scalar quanti- 
ties, as determined relative to arbitrary observers. The reformulation provides 
a certain insight into the geometric significance of the equation. 30 

Let S be any smooth spacelike hypersurface in M 31 The background metric 
g a b induces a (three-dimensional) metric 3 g a b on S. In turn, this metric deter- 
mines on S a derivative operator, an associated Riemann curvature tensor field 
3 R a bcd , and a scalar curvature field 3 R = ( 3 R a bca )( 3 g bc )- Our reformulation of 
Einstein's equation will direct attention to the values of 3 R at a point for a 
particular family of spacelike hypersurfaces passing through it. 32 

Let p be any point in M and let £ a be any future-directed unit timclikc 
vector at p. Consider the set of all geodesies through p that are orthogonal to 

30 Another approach to its geometrical significance proceeds via the equation of geodesic 
deviation. See, for example, Sachs and Wu I1977bl p. 114]. 

31 We can take this to mean that S is a smooth, imbedded, three-dimensional submanifold 
of M with the property that any curve 7 : / — > M with image in S is spacelike. 

32 In the case of a surface in three-dimensional Euclidean space, the associated Riemann 
scalar curvature 2 R is (up to a constant) just ordinary Gaussian surface curvature. We can 
think of 3 i? in the present context as a higher dimensional analogue that gives averaged values 
of Gaussian surface curvature. This can be made precise. See, for example, Laugwitz 119651 
p. 127]. 
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£ a there. The (images of these) curves, at least when restricted to a sufficiently 
small open set containing p, sweep out a smooth spacelike hypersurface S. 33 
(See figure l2~5.fi ) We will call it a geodesic hypersurface. (We cannot speak 




Figure 2.5.f : A "geodesic hypersurface" through a point is 
constructed by projecting geodesies in all directions orthog- 
onal to a given timelike vector there. 

of the geodesic hypersurface through p orthogonal to £ a because we have left 
open how far the generating geodesies are extended. But given any two, their 
restrictions to a suitably small open set containing p coincide.) 

Geodesic hypersurfaces are of interest in their own right, the present context 
aside, because they are natural candidates for a notion of "local simultaneity 
slice" (relative to a timelike vector at a point). What matters here, though, is 
that, by the first Gauss-Codazzi equation (Wald 1984, p. 258]), we have 

3 R = R -2i? Qb C£ b (30) 

at p. 34 Here we have expressed the (three-dimensional) Riemann scalar curva- 
ture of S at p in terms of the (four-dimensional) Riemann scalar curvature of 
M at p and the Ricci tensor there. And so, if Einstein's equation l|18|) holds, we 
have 

3 R = -16tt (T ab T^) + 2A. (31) 

at p. 

33 More precisely, let S p be the spacelike hyperplane in M p orthogonal to £ a . Then for any 
sufficiently small open set O in M p containing p, the image of (S p n O) under the exponential 
map exp : O — > M is a smooth spacelike hypersurface. We can take it to be S. (See, for 
example, Hawking and Ellis 119721 p. 33].) 

34 Let £ a — we use the same notation - be the extension of the original vector at p to a 
smooth future-directed unit timelike vector field on S that is everywhere orthogonal to S. 
Then the first Gauss-Codazzi equation asserts that at all points of S 

3 R = R- 2 R ab et b + ^ab h ab + 77 ab TT a >>, 

where h ab is the spatial projection field (g ab — £, a £,b) on S, and 7T (, is the extrinsic curvature 
field g £^h ab on S. But our construction guarantees that -K ab vanish at p. 
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One can also easily work backwards to recover Einstein's equation at p from 
the assumption that (|31|1 holds for all unit timelike vectors £ a at p (and all 
geodesic hypersurfaces through p orthogonal to £ a ). Thus, we have the following 
equivalence. 

Proposition 2.5.1. Let T a b be a smooth symmetric field on M , and let p be 

a point in M . Then Einstein's equation R a b — \ Rg a b + ^9ab = 8 7rT a b holds 
at p iff for all future- directed unit timelike vectors £ a at p, and all geodesic 
hypersurfaces through p orthogonal to £f , the scalar curvature 3 R of S satisfies 
3 i?= [-16n(T ab £ a £ b ) + 2\] a t p . 

The result is particularly instructive in the case where A = 0. Then (|3 1 11 
directly equates an intuitive scalar measure of spatial curvature (as determined 
relative to £ a ) with energy density (as determined relative to £ a ). 

2.6 Congruences of Timelike Curves and "Public Space" 

In this section, we consider congruences of timelike curves. We think of them 
as representing swarms of particles (or fluids). First, we review the standard 
formalism for describing the local rotation and expansion of such congruences. 
Then, we consider different notions of "space" and "spatial geometry" as deter- 
mined relative to them. 

Once again, let (M, g a b) be our background relativistic spacetime (endowed 
with a temporal orientation). Let £ a be a smooth, future-directed, unit timclikc 
vector field on M (or some open subset thereof). We understand it to be the 
four-velocity field of our particle swarm. 

Let h a b be the spatial projection field determined by £ a . Then the rotation 
field Uab and the expansion field 8 a b are defined as follows: 

"ab = h [a m h b] n V m £ n (32) 

Q ah = h (a m h b) n V m Zn. (33) 

They are smooth fields, orthogonal to £ a in both indices, and satisfy 

V o & = w a6 + o6 + £„(rV n &). (34) 

We can give the two fields uj a b and 9 a b a geometric interpretation. Let rf be 
a vector field on the worldlinc of a particle O that is "carried along by the flow 
of £ a ", i.e., £^ rf = 0, and is orthogonal to £ a at a point p. (Here £(. rf is the 
Lie derivative of rf with respect to £ a .) We can think of rf at p as a spatial 
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"connecting vector" that spans the distance between O and a neighboring par- 
ticle N that is "infinitesimally close" . The instantaneous velocity of N relative 
to O at p is given by V„ rj a - But £™ V„ r? a = rf 1 V„ £ a (since £ ? ?/ a = 0). So, 
by QH4JI. and the orthogonality of £ a with rj a at p, we have 

eV n r7 a = « + 0<- (35) 

at the point. Here we have simply decomposed the relative velocity vector into 
two components. The first, (ui n a r) n ), is orthogonal to rf (since u! a b is anti- 
symmetric). It gives the instantaneous rotational velocity of N with respect to 
O at p. 

In support of this interpretation, consider the instantaneous rate of change 
of the squared length (— rf rj a ) of if at p. It follows from (|35|l that 

eV n (-V a Va) = -2dnaV n V a - (36) 

Thus the computed rate of change depends solely on 9 a b- Suppose 9 a b = 0. 
Then the instantaneous velocity of N with respect to O at p has vanishing 
radial component. If Lo a -b ^ 0, N still exhibits a non-zero velocity with respect 
to O. But it can only be a rotational velocity. The two conditions (0 a b = and 
w„t 7^ 0) jointly characterize "rigid rotation". 

The condition Lu a b — 0, by itself, characterizes irrotational flow. One gains 
considerable insight into the condition by considering a second, equivalent for- 
mulation. Let us say that the field £ a is hypersurface orthogonal if there exist 
smooth, real valued maps / and g (with the same domains of definition as £ a ) 
such that, at all points, £ a = f\7 a g. Note that if the condition is satisfied, 
then the hypersurfaces of constant g value are everywhere orthogonal to £ a . 35 
Let us further say that £ a is locally hypersurface orthogonal if the restriction of 
£ a to every sufficiently small open set is hypersurface orthogonal. 

Proposition 2.6.1. Let £ a be a smooth, future-directed unit timelike vector 
field defined on M (or some open subset of M). Then the following conditions 
are equivalent. 

(1) Lo a b = everywhere. 

(2) £ a is locally hypersurface orthogonal. 

35 For if r) a is a vector tangent to one of these hypersurfaces, »7 n V n g = 0. So rj n ^ n = 
?? n (/V«g) = 0. 
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The implication from (2) to (1) is immediate. 36 But the converse is non- 
trivial. It is a special case of Frobenius's theorem (Wald 1 !)S 1 p. 436]). The 
qualification 'locally' can be dropped in (2) if the domain of £ a is, for example, 
simply connected. 

There is a nice picture that goes with the proposition. Think about an ordi- 
nary rope. In its natural twisted state, the rope cannot be sliced by an infinite 
family of slices in such a way that each slice is orthogonal to all fibers. But if the 
rope is first untwisted, such a slicing is possible. Thus orthogonal sliceability is 
equivalent to fiber untwistedness. The proposition extends this intuitive equiva- 
lence to the four-dimensional "spacetime ropes" (i.e., congruences of worldlincs) 
encountered in relativity theory. It asserts that a congruence is irrotational (i.e., 
exhibits no twistedness) iff it is, at least locally, hypersurface orthogonal 

Suppose that our vector field £ a is irrotational and, to keep things simple, 
suppose that its domain of definition is simply connected. Then the hypersur- 
faces to which it is orthogonal are natural candidates for constituting "space" 
at a given "time" relative to £ a or, equivalently, relative to its associated set of 
integral curves. This is a notion of public space to be contrasted with private 
space, which is determined relative to individual timelike vectors or timelike 
curves. 37 Perhaps the best candidates for the latter are the "geodesic hypersur- 
faces" we considered, in passing, in section T2. 51 (Given a point p and a timelike 
vector £ a there, we took a "geodesic hypersurface through p orthogonal to £°" 
to be a spacelike hypersurface generated by geodesies through p orthogonal to 

The distinction between public and private space is illustrated in Figure 
12.6.11 There we consider a congruence of future-directed timelike half-geodesics 
in Minkowski spacetime starting at some particular point p. One line O in the 
congruence is picked out along with a point q on it. Private space relative to O 
at q is a spacelike hypersurface S pr i va t e that is flat, i.e., the metric induced on 
Sprivate has a Riemann curvature tensor field 3 R a {, cc [ that vanishes everywhere. 

36 Assume that f a = / V a 9- Then 

<"«b = h [a m h b] n \> m Z n =h [c rh b] "V m (fV n g) 

= / v" v Vm v ™ 9 + \r v ( Vm f) ( v " 9) 

= fh a m h b "V [m V n] g + h a m h b n ( V [m /) (V n] g). 

But V[ m V n ] g = since V a is torsion-free, and the second term in the final line vanishes as 
well since h b n V„ g = f~ x h b n £ n = 0. So uj ab = 0. 

37 The distinction between "public space" and "private space" is discussed in Rindler 119811 
and Page 119831 . The terminology is due to E. A. Milne. 
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Figure 2.6.1: "Private space" Sprivate at P relative to 0, 
and "public space" S pu bu c at p relative to a congruence of 
timelike curves of which is a member. 

In contrast, public space at q relative to the congruence is a spacelike hypersur- 
face Sp U biic of constant negative curvature. If £ a is the future-directed unit time- 
like vector field everywhere tangent to the congruence, and h a b — (g a b — £ £&) is 
its associated spatial projection field, then the curvature tensor field on S pu uic 
associated with h a b has the form 3 R a bcd = ~-£^{h a c.hbd — hadhbc), where K 
is the distance along O from p to q. (This is the characteristic form for a 
three- manifold of constant curvature —f^r-) 

We have been considering "public space" as determined relative to an irrota- 
tional congruence of timelike curves. There is another sense in which one might 
want to use the term. Consider, for example, "geometry on the surface of a 
rigidly rotating disk" in Minkowski spacetime. (There is good evidence that 
Einstein's realization that this geometry is non-Euclidean played an important 
role in his development of relativity theory (Stachel 1980 ).) One needs to ask 
in what sense the surface of a rotating disk has a geometric structure. 

We can certainly model the rigidly rotating disk as a congruence of time- 
like curves in Minkowski spacetime. (Since the disk is two-dimensional, the 
congruence will be confined to a three-dimensional, timelike submanifold M' 
of M .) But precisely because the disk is rotating, we cannot find hypersur- 
faces everywhere orthogonal to the curves and understand the geometry of the 
disk to be the geometry induced on them - or, strictly speaking, induced on 
the two-dimensional manifolds determined by the intersection of the putative 
hypersurfaces with M' - by the background spacetime metric g a b- 

The alternative is to think of "space" as constituted by the "manifold of 
trajectories", i.e., take the individual timelike curves in the congruence to play 
the role of spatial points, and consider the metric induced on this manifold by the 
background spacetime metric. The construction will not work for an arbitrary 
congruence of timelike curves. It is essential that we are dealing here with a 
"stationary" system. (The metric induced on the manifold of trajectories (when 
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the construction works) is fixed and frozen.) But it does work for these systems, 
at least. More precisely, anticipating the terminology of the following section, 
it works if the four-velocity field of the congruence in question is proportional 
to a Killing field. (The construction is presented in detail in Geroch [19711 
Appendix].) 

Thus we have two notions of "public space" . One is available if the four- 
velocity field of the congruence in question is irrotational; the other if it is 
proportional to a Killing field. Furthermore, if the four-velocity field is irrota- 
tional and proportional to Killing field, as is the case when we dealing with a 
"static" system, then the two notions of public space are essentially equivalent. 

2.7 Killing Fields and Conserved Quantities 

Let n a be a smooth vector field on M. We say it is a Killing field if £ K g a b = 0, 
i.e., if the Lie derivative with respect to k" of the metric vanishes. 38 This is 
equivalent to the requirement that the "flow maps" {F s } generated by K a are 
all isometries. (See Wald (19841 p. 441].) For this reason, Killing fields are 
sometimes called "infinitesimal generators of smooth one-parameter families of 
isometries" or "infinitesimal symmetries" . The defining condition can also be 
expressed as 39 

W (a K b) = 0. (37) 

This is "Killing's equation" . 

Given any two smooth vector fields £ a and /i a on M, the bracket or commu- 
tator field [£, /i] a defined by [£, fj] a = £^ /i a is also smooth. The set of smooth 
vector fields on M forms a Lie algebra with respect to this operation, i.e., the 
bracket operation is linear in each slot; it is anti-symmetric ([£, /i] a = — [/J.,<S;] a ); 
and it satisfies the Jacobi identity 

[[&/*]> v \ a + + [Mt]° = (38) 

for all smooth vector fields £°, fi a , and v a on M, It turns out that the bracket 
field of two Killing fields is also a Killing field. So it follows, as well, that the 
set of Killing fields on M has a natural Lie algebra structure. 

38 We drop the index on ft here to avoid giving the impression that £ K g a ^ is a three index 
tensor field. Lie derivatives are always taken with respect to (contravariant) vector fields, 
so no ambiguity is introduced when the index is dropped. A similar remark applies to our 
bracket notation below. 

39 This follows since £ K g a i, = K n V n g a b + 9ubV a K n + (?anV(, n n , and V a is compatible with 
9ab, i-e., V„s a6 = 0. 
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The discussion of smooth symmetries in spacetime, and their associated con- 
served quantities, is naturally cast in the language of Killing fields. For example, 
we can use that language to capture precisely the following intuitive notions. A 
spacetime is stationary if it has a Killing field that is everywhere timelikc. It 
is axially symmetric if it has a Killing field that is everywhere spacelike, and 
has integral curves that are closed. (The "axis" in this case is the set of points, 
possibly empty, where the Killing field vanishes.) Finally, a spacetime is spher- 

12 3 

ically symmetric if it has three Killing fields cr a ,a a ,a a that (i) are everywhere 
spacelike, (ii) are linearly dependent at every point, i.e., a^ a a b a c ^ = 0, and (iii) 
exhibit the same commutation relations as do the generators of the rotation 
group in three dimensions: 

[a , a] a = a a , [a, a] a = a a , [a, a] a = l a . (39) 

Now we consider, very briefly, two types of conserved quantity. One is an 
attribute of massive point particles, the other of extended bodies. Let n a be an 
arbitrary Killing field, and let 7 : I — *■ M be a timelike curve, with unit tangent 
field £ a , whose image is the worldline of a point particle with mass m > 0. 
Consider the quantity J = (P a K a ), where P a = m£ a is the four-momentum of 
the particle. It certainly need not be constant on 7[/]. But it will be if 7 is a 
geodesic. For in that case, £ n V„ £ a = and hence, by (|3TJl . 

rv„ j - m (« a £»v„ r + ce v„ «„) - ™cr v (n « a) = o. (40) 

Thus, the value of J (construed as an attribute of massive point particles) is 
constant for free particles. 

We refer to J as the conserved quantity associated with K a . If n a is timelike, 
and if the flow maps determined by n a have the character of translations 40 , then 
J is called the energy of the particle (associated with K a ). 41 If it is spacelike, 

40 In Minkowski spacetime, one has an unambiguous classification of Killing fields as gen- 
erators of translations, spatial rotations, boosts (and linear combinations of them). No such 
classification is available in general. Killing fields are just Killing fields. But sometimes a 
Killing field in a curved spacetime resembles a Killing field in Minkowski spacetime in certain 
respects, and then the terminology may carry over naturally. For example, in the case of 
asymptotically flat spacetimes, one can classify Killing fields by their asymptotic behavior. 

41 If K a is of unit length everywhere, this usage accords well with that in section 12.31 For 
there ascriptions of energy to point particles were made relative to unit timelike vectors, and 
the value of the energy at any point was taken to be the inner product of that unit timelike 
vector with the particle's four-momentum vector. If K a is, at least, of constant length, then 
one can always rescale it so as to achieve agreement of usage. But, in general, Killing fields, 
timelike or otherwise, are not of constant length, and so the current usage must be regarded 
as a generalization of that earlier usage. 
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and if the flow maps have the character of translations, then J is called the 
component of linear momentum of the particle (associated with n a ). Finally, 
if n a is spacelike, and if the flow maps have the character of rotations, then it 
is called the component of angular momentum of the particle (associated with 
n a ). 

It is useful to keep in mind a certain picture that helps one to "see" why 
the angular momentum of free particles (to take that example) is conserved. It 
involves an analogue of angular momentum in Euclidean plane geometry. Figure 
12. 7. II shows a rotational Killing field K a in the Euclidean plane, the image of a 




Figure 2.7.1: k° is a rotational Killing field. (It is every- 
where orthogonal to a circle radius, and proportional to it 
in length.) £ a is a tangent vector field of constant length 
on the line. The inner-product between them is constant. 
(Equivalcntly, the length of the projection of K a onto the 
line is constant.) 

geodesic (i.e., a line L), and the tangent field £ a to the geodesic. Consider the 
quantity J = £ a K a , i-e., the inner product of £ a with K a , along L. Exactly the 
same proof as before (in equation (|40|l ) shows that J is constant along L. 42 But 
here we can better visualize the assertion. 

Let us temporarily drop indices and write n ■ £ as one would in ordinary 
Euclidean vector calculus (rather than £ a K a )- Let p be the point on L that 
is closest to the center point where k vanishes. At that point, n is parallel to 
£. As one moves away from p along L, in either direction, the length \k\ of 
k grows, but the angle Z(k,£) between the vectors increases as well. It is at 

42 The mass m played no special role. 
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least plausible from the picture (and easy to check directly with an argument 
involving similar triangles) that the length of the projection of k onto the line is 
constant. Equivalently, the inner product k-£ = cos(Z(k, £)) ||k|| is constant. 

That is how to think about the conservation of angular momentum for free 
particles in relativity theory. It does not matter that in the latter context we are 
dealing with a Lorentzian metric and allowing for curvature. The claim is still 
that a certain inner product of vector fields remains constant along a geodesic, 
and we can still think of that constancy as arising from a compensatory balance 
of two factors. 

Let us now turn to the second type of conserved quantity, the one that is 
an attribute of extended bodies. Let n a be an arbitrary Killing field, and let 
T a b be the energy-momentum field associated with some matter field. Assume 
it satisfies the conservation condition. Then (T ab K b ) is divergence free: 

V a (T ab Kb ) = K b V a T ab + T ab V a n b = T ab V {a n b) = 0. (41) 

(The second equality follows from the conservation condition for T ab (in section 
12. 4[) and the symmetry of T ab ; the third from the fact that n a is a Killing field.) 
It is natural, then, to apply Stokes' theorem to the vector field (T ab K b ). 

Consider a bounded system with aggregate energy-momentum field T ab in an 
otherwise empty universe. Then there exists a (possibly huge) timelike world 
tube such that T ab vanishes outside the tube (and vanishes on its boundary). 
Let Si and 5*2 be (non-intersecting) spacelike hypersurfaces that cut the tube 




Figure 2.7.2: The integrated energy (relative to a back- 
ground timelike Killing field) over the intersection of the 
world tube with a spacelike hypersurface is independent of 
the choice of hypersurface. 

as in figure l2~7. 21 and let N be the segment of the tube falling between them 
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(with boundaries included). By Stokes' theorem, 

/ (T ab K b )dS a - f (T ab Kb )dS a 
JS 2 JSi 

= I {T ab K b )dS a - f {T ab n b )dS a 

JS 2 ndN JSindN 

= f {T ab Kb )dS a = f V a {T ab Kb )dV = 0. 

J ON JN 

Thus, the integral f s (T ab K b )dS a is independent of the choice of spacelike 
hypersurface S intersecting the world tube, and is, in this sense, a conserved 
quantity (construed as an attribute of the system confined to the tube). An 
"early" intersection yields the same value as a "late" one. Again, the character 
of the background Killing field k° determines our description of the conserved 
quantity in question. If K a is timelike, we take J s (T ab K b ) dS a to be the aggre- 
gate energy of the system (associated with K a ). And so forth. 

For further discussion of symmetry and conservation principles in general 
relativity, see Brading and Castellani (this volume, chapter 13). 

3 Special Topics 

3.1 Relative Simultaneity in Minkowski Spacetime 

We noted in section IP1 when discussing the decomposition of vectors at a point 
into their "temporal" and "spatial" components relative to a four- velocity vector 
there, that we were taking for granted the standard identification of relative 
simultaneity with orthogonality. Here we return to consider the justification of 
that identification. 

Rather than continue to cast the discussion as one concerning the decom- 
position of the tangent space at a particular point, it is convenient to construe 
it instead as one about the structure of Minkowski spacetime, the regime of 
so-called "special relativity" . Doing so will bring it closer to the framework in 
which traditional discussions of the status of the relative simultaneity relation 
have been conducted. 

Minkowski spacetime is a relativistic spacetime (M, g ab ) characterized by 
three conditions: (i) M is the manifold E 4 ; (ii) (M,g ab ) is flat, i.e., g ab has 
vanishing Riemann curvature everywhere; and (iii) (M, g ab ) is geodesically com- 
plete, i.e., every geodesic (with respect to g ab ) can be extended to arbitrarily 
large parameter values in both directions. 
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By virtue of these conditions, Minkowski spacetime can be canonically iden- 
tified with its tangent space at any point, and so it inherits the structure of a 
"metric affine space" in the following sense. Pick any point o in M, and let V 
be the tangent space M at o. Then there is a map (p, q) i— ► pq from M x M 
to V with the following two properties. 

(1) For all p, q and r in M, pq + qr = pr. 

(2) For all p in M, the induced map q^pq from M to V is a bijection. 43 

The triple consisting of the point set M, the vector space V, and the map 
(j), q) I— > pq forms an affine space. If we add to this triple the inner product on 
V defined by g a b it becomes a (Lorentzian) metric affine space. (For convenience 
we will temporarily drop the index notation and write (v, w) instead of g a bV a w b 
for v and w in V.) We take all this structure for granted in what follows, i.e., 
we work with Minkowski spacetime and construe it as a metric affine space in 
the sense described. This will simplify the presentation considerably. 

We also use an obvious notation for orthogonality. Given four points p, q, r, s 
in M, we write pq _L rs if (pq, rs) = 0. And given a line 44 O in M, we write 
pq _L O if pq _L rs for all points r and sonO. 

Now consider a timelike line O in M. What pairs of points (p, q) in M 
should qualify as being "simultaneous relative to O"? That is the question we 
are considering. The standard answer is that they should do so precisely if 
pq _L O. 

In traditional discussions of relative simultaneity, the standard answer is 
often cast in terms of "epsilon" values. The connection is easy to see. Let p be 
any point that is not on our timelike line O. Then there exist unique points r 
and s on O (distinct from one another) such that rp and ps are future-directed 
null vectors. (See figure l3~l.ll ) Now let q be any point on O. (We think of 
it as a candidate for being judged simultaneous with p relative to O.) Then 

43 If exp is the exponential map from M to M, we can take pq to be the vector 

{exp~ 1 (q) - exp~ 1 (p)) 

in M a . All other standard properties of affine spaces follow from these two. E.g., it follows 
that pq = p = q, for all p and q in M. (Here is the zero vector in V.) 

44 In the present context we can characterize a line in more than one way. We can take 
it to be the image of a maximally extended geodesic that is non-trivial, i.e., not a point. 
Equivalcntly, we can take it to be a set of points of the form {r : pr = e pq for some e in R} 
where p and q are any two (distinct) points in M. 
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Figure 3.1.1: The e = | characterization of relative simul- 
taneity: p and q are simultaneous relative to O iff q is 
midway between r and s. 



= e rs for some e £ R. A simple computation 45 shows that 

e = - re _L fs. (42) 

So the standard (orthogonality) relation of relative simultaneity in special rel- 
ativity may equally well be described as the "e = i" relation of relative simul- 
taneity. 

Yet another equivalent formulation involves the "one- way speed of light" . 
Suppose a light ray travels from r to p with speed c + relative to O, and from p 
to s with speed c_ relative to O. We saw in section |2~31 that z/ one adopts the 
standard criterion of relative simultaneity, then it follows that c + = c_. The 
converse is true as well. For if c + = c_, then, as determined relative to O, it 
should take as much time for light to travel from r to p as from p to s. And 
in that case, a point q on O should be judged simultaneous with p relative to 
O precisely if it is midway between r and s. So we are led, once again, to the 
"e = i" relation of relative simultaneity. 

Now is adoption of the standard relation a matter of convention, or is it in 
some significant sense forced on us? 

45 First note that, since ps and pr are null, 

= {ps,ps) = (pr + rs, pr + rs) = 2(pr, rs) + (rs, rs). 

It follows that 

(pq, rs) = (pr + rq, rs) = (pr + ers, rs) = (pr, rs) + e(rs, rs) = (e — — )(rs, rs), 
which implies 1421 . 
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There is, of course, a large literature devoted to this question. 6 It is not my 
purpose to review it here, but I do want to draw attention to certain remarks 
of Howard Stein [19911 pp. 153-4] that seem to me particularly insightful. He 
makes the point that determinations of conventionality require a context. 

There are really two distinct aspects to the issue of the "convention- 
ality" of Einstein's concept of relative simultaneity. One may assume 
the position of Einstein himself at the outset of his investigation - 
that is, of one confronted by a problem, trying to find a theory that 
will deal with it satisfactorily; or one may assume the position of 
(for instance) Minkowski - that is, of one confronted with a theory 
already developed, trying to find its most adequate and instructive 
formulation. 

The problem Einstein confronted was (in part) that of trying to account for 
our apparent inability to detect any motion of the earth with respect to the 
"aether" . A crucial element of his solution was the proposal that we think 
about simultaneity a certain way (i.e., in terms of the "e = h criterion"), and 
resolutely follow through on the consequences of doing so. Stein emphasizes 
just how different that proposal looks when we consider it, not from Einstein's 
initial position, but rather from the vantage point of the finished theory, i.e., 
relativity theory conceived as an account of invariant spacetime structure. 

[For] Einstein, the question (much discussed since Rcichcnbach) 
whether the evidence really shows that that the speed of light must 
be regarded as the same in all directions and for all observers is not 
altogether appropriate. A person devising a theory does not have 
the responsibility, at the outset, of showing that the theory being 
developed is the only possible one given the evidence. [But] once 
Einstein's theory had been developed, and had proved successful in 
dealing with all relevant phenomena, the case was quite transformed; 
for we know that within this theory, there is only one "reasonable" 
concept of simultaneity (and in terms of that concept, the velocity 
of light is indeed as Einstein supposed) ; therefore an alternative will 
only present itself if someone succeeds in constructing, not simply a 

46 Classic statements of the conventionalist position can be found in Reichenbach 119581 and 
Griinbaum |1973l . Griinbaum has recently responded to criticism of his views in |forthcoming| . 
An overview of the debate with many references can be found in Janis I2UU^I . 
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different empirical criterion of simultaneity, but an essentially differ- 
ent (and yet viable) theory of electrodynamics of systems in motion. 
No serious alternative theory is in fact known, (emphasis in original) 

My goal in the remainder of this section is to formulate three elementary 
uniqueness results, closely related to one another, that capture the sense in 
which "there is only one 'reasonable' concept of (relative) simultaneity" within 
the framework of Minkowski spacetime. 

It will help to first consider an analogy. In some formulations of Euclidean 
plane geometry, the relation of congruence between angles is taken as primi- 
tive along with that of congruence between line segments (and other relations 
suitable for formulating axioms about affine structure). But suppose we have 
a formulation in which it is not, and we undertake to define a notion of angle- 
congruence in terms of the other primitives. The standard angle-congruence 
relation can certainly be defined this way, and there is a clear sense in which 
it is the only reasonable candidate. Consider any two angles in the Euclidean 
plane. (Let's agree that an "angle" consists of two rays, i.e., half- lines, with a 
common initial point.) Whatever else is the case, presumably, it is only reason- 
able to count them as congruent, i.e., equal in "size", if there is an isometry of 
the Euclidean plane that maps one angle onto the other. 47 So though we have 
here a notion of angle-congruence that is introduced "by definition" , there is no 
interesting sense in which it is conventional in character. 

A situation very much like this arises if we think about "one-way light speeds" 
in terms of Minkowskian spacetime geometry. Indeed, the claim that the speed 
of light in vacuo is the same in all directions and for all inertial observers is 
naturally represented as a claim about angle congruence (for a special type of 
angle) in Minkowski spacetime. 

Let us take a "light-speed angle" to be a triple of the form (p,T,N), where 
p is a point in M, T is a future-pointing timelike ray with initial point p, and 
TV is a future-pointing null ray with initial point p. (See figure IXl^l l Then 
we can represent systematic attributions of one-way light speed as maps of the 
form: (p,T,N) i— > v(p,T,N). (We understand v(p,T,N) to be the speed that 
an observer with (half) worldline T at p assigns to the light signal with (half) 
worldline N.) So, for example, the principle that the speed of light is the same 
in all directions and for all inertial observers comes out as the condition that 
v(p, T, N) = v(p', T', N') for all light-speed angles (p, T, N) and (p', T', N'), 

47 In this context, a one-to-one map of the Euclidean plane onto itself is an "isometry" if it 
preserves the relation of congruence between line segments. 
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Figure 3.1.2: Congruent "light speed angles" in Minkowski 
spacetime. 



Now it is natural to regard v(p, T, N) as a measure of the "size" of the angle 
(p,T,N). If we do so, then, just as in the Euclidean case, we can look to the 
background metric to decide when two angles have the same size. That is, we 
can take them to be congruent iff there is an isometry of Minkowski spacetime 
that maps one to the other. But on this criterion, all light-speed angles are 
congruent (proposition ^. 1 . l"|) . So we are led back to the principle that the (one- 
way) speed of light is the same in all directions and for all inertial observers 
and, hence, back to the standard relative simultaneity relation. 

Proposition 3.1.1. Let (p,T,N) and (p',T',N') be any two light speed angles 
in Minkowski spacetime. Then there is an isometry cj> of Minkowski spacetime 
such that cj){p) = p', 4>[T] = T', and </>[N] = TV'. 48 

Once again, let O be a timelike line in M, and let Simo be the standard 
relation of simultaneity relative to O. (So (p, q) £ Simo iS pq ± O, for all 
p and q in M .) Further, let S be an arbitrary two-place relation on M that 
we regard as a candidate for the relation of "simultaneity relative to O" . Our 
second uniqueness result asserts that if S satisfies three conditions, including an 
invariance condition, then S — Simo- 

The first two conditions are straightforward. 

51 S is an equivalence relation (i.e., S is reflexive, symmetric, and transitive). 

52 For all points p £ M, there is a unique point q £ O such that (p, q) £ S. 

48 The required isometry can be realized in the form <f> = ^3 a ^2 ° 4>l where (i) (j>\ is a 
translation that takes p to p 1 , (ii) <jf>2 is a boost (based at p') that maps <j>i\T] to T", and (iii) 
4>s is a rotation about T 1 that maps (02 ° </>i)[iV] to N' . 

49 Many propositions of this form can be found in the literature. (See Budden 199S] for a 
review.) Ours is intended only as an example. There are lots of possibilities here depending 
on exactly how one formulates the conditions that S must satisfy. The proofs are all very 
much the same. 
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If S satisfies (SI), it has an associated family of equivalence classes. We can 
think of them as "simultaneity slices" (as determined relative to O). Then (S2) 
asserts that every simultaneity slice intersects O in exactly one point. 

The third condition is intended to capture the requirement that S is deter- 
mined by the background geometric structure of Minkowski spacetime and by 
O itself. But there is one subtle point here. It makes a difference whether tem- 
poral orientation counts as part of that background geometric structure or not. 
Let's assume for the moment that it does not. 

Let us say, quite generally, that S is implicitly definable in a structure of the 
form (M, ...) if it is invariant under all symmetries of (M, ...), i.e., for all such 
symmetries <fi : M — > M, and all points p and q in M, 

( P ,q)eS <=► (#p),0(g)) 6 5. (43) 

(Here, a symmetry of (M, ...) is understood to be a bijection cj> : M — > M 
that preserves all the structure in "...".) This is certainly a very weak sense of 
definability. 50 

We are presently interested in the structure (M, ( ), O). 51 Its symmetries 
are bijections <fr '■ M — > M such that, for all p, q, r, s in M, 

Wp) W)W)) = H rt) (44) 

and 

peO ^ <j>(p) € O. (45) 

They are generated by maps of the following three types: (i) translations ("up 
and down") in the direction of O, (ii) spatial rotations that leave fixed every 
point in O, and (iii) temporal reflections with respect to spacelike hyperplanes 
orthogonal to O. Our second uniqueness result can be formulated as follows. 52 

Proposition 3.1.2. Let S be a two-place relation on Minkowski spacetime that 
satisfies conditions (SI), (S2), and is implicitly definable in (M, ( ), O). Then 
S = Simo- 

50 We follow Budden 1998 in using "implicit definability in (M, ...)" as a convenient device 
for organizing various closely related uniqueness results. One moves from one to another 
simply by shifting the choice of . 

51 Strictly speaking, since we are here thinking of Minkowski spacetime as a metric affinc 
space, we should include elements of structure (to the right of M) that turn the point set M 
into an affine space, i.e., add a four-dimensional vector space V and a map from MxMto 
V satisfying conditions (1) and (2) listed at beginning of this section. But it is cumbersome 
to do so every time. Let them be understood in what follows. 

52 It is a close variant of one presented in Hogarth 1993 . 
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As it turns out, the full strength of the stated invariance condition is not 
needed here. It suffices to require that S be invariant under maps of type 
(iii). 53 

Suppose now that we do want to consider temporal orientation as part of the 
background structure that may play a role in the determination of S. Then the 
requirement of implicit definability in (M, ( ), O) should be replaced by that 
of implicit definability in (M, T, ( ), O), where T is the background temporal 
orientation. Maps of type (i) and (ii) qualify as symmetries of this enriched 
structure (too), but temporal reflections of type (iii) do not. Implicit definablity 
in (M, T, { ) , O) is a weaker condition than implicit definability in [M, ( ) , O) , 
and proposition 13 . 1 . 2l fails if the latter condition is replaced by the former. 

But we can still get a uniqueness result if we change the set-up slightly: 
namely, we think of O as merely one member of a congruence of parallel timelike 
lines To, the frame of O, and think of simultaneity as determined relative to 
the latter. Implicit definability in [M, T, ( ), To) is a stronger condition than 
implicit definability in (M, T, ( ), O) because there are symmetries of the former 
- (iv) translations taking O to other lines in To, and (v) spatial rotations that 
leave fixed the points of some line in To other than O - that are not symmetries 
of the latter. Our variant formulation of the uniqueness result is the following. 54 

Proposition 3.1.3. Let S be a two-place relation on Minkowski spacetime that 
satisfies conditions (SI), (S2), and is implicitly definable in (M, T, ( ), To)- 
Then S = Simo- 

The move from proposition ^. 1 ,21 to proposition ^ . 1 . 31 involves a trade-off. We 
drop the requirement that S be invariant under maps of type (iii), but add the 
requirement that it be invariant under those of type (iv) and (v). 55 

Once again, many variations of these results can be found in the literature. 
For example, if one subscribes to a "causal theory of time (or spacetime)" , one 

53 The key step in the proof is the following. Let p be a point in M. By (S2), there is a 
unique point q on O such that (p, q) £ S. Let <j> : M — » M be a reflection with respect to 
the hyperplane orthogonal to O that passes through p. Then <j>(p) = p, <t>{q) E O, and S 
is invariant under 4>. Hence (p,<fi(q)) = (o(p), ©(<?)) £ S. Since <j>(q) £ O, it follows by the 
uniqueness condition in (S2) that 4>(q) = q. But the only points left fixed by <j> are those on the 
hyperplane orthogonal to O that passes through p. So p and q are both on that hyperplane, 
and pq is orthogonal to O, i.e., {p,q) G Simo. 

54 It is closely related to propositions in Spirtes 2981;, Stein |1991l . and Budden I1998| . 

55 It is a good exercise to check that one does not need the full strength of the stated 
invariance condition to derive the conclusion. It suffices to require that S be invariant under 
maps of type (i) and (v), or, alternatively, invariant under maps of type (ii) and (iv). 
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will want to consider what candidate simultaneity relations are determined by 
the causal structure of Minkowski spacetime (in adddition to the line O). Let 
k be the symmetric two-place relation of "causal connectibility" in M, i.e., the 
relation that holds of two points p and q if pq is a causal vector. Clearly, 
every symmetry of (M, ( )) is a symmetry of (M, k). So the requirement of 
implicit definability in (M, k, O) is at least as strong as that of implicit defin- 
ability in (M, ( ) , O) . It follows that we can substitute the former for the latter 
in proposition 13.1.21 Similarly, we can substitute the requirement of implicit 
definability in (M, T, k, To) for that of implicit definability in (M, T, ( ), To) 
in proposition 13.1.31 

3.2 Geometrized Newtonian Gravitation Theory 

The "geometrized" formulation of Newtonian gravitation theory was first in- 
troduced by Cartan |1923l m?2"I] . and Friedrichs |1927| . and later developed by 
Trautman [TOd3| . Klinzle [HT721 IT375] . Ehlers QUO], and others. 

It is significant for several reasons. (1) It shows that several features of rela- 
tivity theory once thought to be uniquely characteristic of it do not distinguish 
it from (a suitably reformulated version of) Newtonian gravitation theory. The 
latter too can be cast as a "generally covariant" theory in which (a) gravity 
emerges as a manifestation of spacetime curvature, and (b) spacetime structure 
is "dynamical", i.e., participates in the unfolding of physics rather than being 
a fixed backdrop against which it unfolds. 

(2) It helps one to see where Einstein's equation "comes from", at least in 
the empty-space case. (Recall the discussion in section 1231 1 It also allows one 
to make precise, in coordinate-free, geometric language, the standard claim that 
Newtonian gravitation theory (or, at least, a certain generalized version of it) is 
the "classical limit" of general relativity. (See Kiinzle 1976 and Ehlers 1981 .) 

(3) It clarifies the gauge status of the Newtonian gravitational potential. In 

the geometrized formulation of Newtonian theory, one works with a single curved 

a 

derivative operator V a . It can be decomposed (in a sense) into two pieces - a flat 
derivative operator V a and a gravitational potential (f> - to recover the standard 
formulation of the theory. 56 But in the absence of special boundary conditions, 
the decomposition will not be unique. Physically, there is no unique way to 

56 As understood here, the "standard" formulation is not that found in undergraduate text- 
books, but rather a "generally covariant" theory of four-dimensional spacetime structure in 
which gravity is not geometrized. 
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divide into "inertial" and "gravitational" components the forces experienced by 
particles. Neither has any direct physical significance. Only their "sum" does. 
It is an attractive feature of the geometrized formulation that it trades in two 
gauge quantities for this sum. 

(4) The clarification described in (3) also leads to a solution, or dissolution, 
of an old conceptual problem about Newtonian gravitation theory, namely the 
apparent breakdown of the theory when applied (in cosmology) to a hypotheti- 
cally infinite, homogeneous mass distribution. (See Malament J1995 and Norton 

In what follows, we give a brief overview of the geometrized formulation of 
Newtonian gravitation theory, and say a bit more about points (1) and (3). 
We start by characterizing a new class of geometrical models for the spacetime 
structure of our universe (or subregions thereof) that is broad enough to include 
the models considered in both the standard and geometrized versions of New- 
tonian theory. We take a classical spacetime to be a structure (M,t a f ) ,h ab ,V a ) 
where (i) M is a smooth, connected, four-dimensional differentiable manifold; 
(ii) t a {, is a smooth, symmetric, covariant tensor field on M of signature (1, 0, 
0, 0); 57 (iii) h ab is a smooth, symmetric, contravariant tensor field on M of 
signature (0, 1, 1, 1); (iv) V a is a smooth derivative operator on M; and (v) the 
following two conditions are met: 

h ab t bc = (46) 
V i bc = = W a h bc . (47) 

We refer to them, respectively, as the "orthogonality" and "compatibility" con- 
ditions. 

M is interpreted as the manifold of point events (as before); t a b and h ab are 
understood to be temporal and spatial metrics on M, respectively; and V a is 

57 The stated signature condition is equivalent to the requirement that, at every point p in 

1 4 

M, the tangent space M p has a basis £ a , £ a such that, for all i and j in {1, 2, 3, 4}, 



1 if i = 1 

if i = 2, 3, 4 



and tao?"? 6 = if i ^ j. Similarly, the signature condition on h ab stated in (iii) requires 
that, at every point, the co-tangent space have a basis <r a , ...,<r a such that, for all i and j in 
{1,2,3,4}, 

uab < i _ j ® if i = 1 

[I if i = 2, 3, 4 

and h ab a a bb = if i ^ j. 
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understood to be an affine structure on M. Collectively, these objects represent 
the spacetime structure presupposed by classical, Galilean rclativistic dynamics. 
We review, briefly, how they do so. 

In what follows, let (M, t a b, h ab , V a ) be a fixed classical spacetime. 

Consider, first, t a b- Given any vector £ a at a point, it assigns a "temporal 
length" {t a b£ a £ b )^ > 0. The vector £ a is classified as timelike or spacelike 
depending on whether its temporal length is positive or zero. It follows from 
the signature of t a b that the subspace of spacelike vectors at any point is three- 
dimensional. It also follows from the signature that at every point there exists 
a covariant vector t a , unique up to sign, such that t ao = t a t - We say that 
the structure (M,t ao , h ab , V a ) is temporally orientable if there is a continuous 
(globally defined) vector field t a such that this decomposition holds at every 
point. Each such field t a (which, in fact, must be smooth because t a b is) is a 
temporal orientation. A timclike vector £ a qualifies as future- directed relative 
to t a if t a £ a > 0; otherwise it is past- directed. Let us assume in what follows 
that (M,t a b,h ab ,\7 a ) is temporally orientable and that a temporal orientation 
t a has been selected. 

From the compatibility condition, it follows that t a is closed, i.e., V[ a i&] = 0. 
So, at least locally, it must be exact, i.e., of form t a = V a t for some smooth 
function t. We call any such function a time function. If M has a suitable global 
structure, e.g., if it is simply connected, then a globally defined time function 
t : M — > K must exist. In this case, spacetime can be decomposed into a 
one-parameter family of global (t = constant) "time slices" . One can speak of 
"space" at a given "time". A different choice of time function would result in 
a different zero-point for the time scale, but would induce the same time slices 
and the same elapsed intervals between them. 

We say that a smooth curve is timelike (respectively spacelike) if its tangent 
field is timelike (respectively spacelike) at every point. In what follows, un- 
less indication is given to the contrary, it should further be understood that a 
"timelike curve" is future-directed and parametrized by its t a b - length. In this 
case, its tangent field £ a satisfies the normalization condition i £ a = 1. Also, 
in this case, if a particle happens to have the image of the curve as its world- 
line, then, at any point, £ a is called the particle's four-velocity, and £ ra V„ £ a its 
four- acceleration, there. 58 If the particle has mass m, then its four-acceleration 

58 Here we take for granted an interpretive principle that corresponds to CI: (i) a curve 
is timelike iff its image could be the worldlinc of a point particle. Other principles we can 
formulate at this stage correspond to PI and P2: (ii) a timelikc curve can be reparametrized 
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field satisfies the equation of motion 

f a =mf'V„f, (48) 

where F a is a spacelike vector field (on the image of its worldline) that represents 
the net force acting on the particle. This is, once again, our version of Newton's 
second law of motion. Recall (|15fl. Note that the equation makes geometric 
sense because four-acceleration vectors are necessarily spacelike. 59 

Now consider h ab . It serves as a spatial metric, but just how it does so is a bit 
tricky. In Galilean relativistic mechanics, we have no notion of spatial length for 
timelike vectors, e.g., four- velocity vectors, since having one is tantamount to a 
notion of absolute rest. (We can take a particle to be at rest if its four- velocity 
has spatial length everywhere.) But we do have a notion of spatial length 
for spacelike vectors, e.g., four-acceleration vectors. (We can, for example, use 
measuring rods to determine distances between simultaneous events.) h ab serves 
to give us one without the other. 

We cannot take the spatial length of a vector a a to be (h a b(T a a b )^ because 
the latter is not well-defined. (Since h ab has degenerate signature, it is not 
invertible, i.e., there does not exist a field h a b satisfying h ab hb c — 5 a c .) But 
if a a is spacelike, we can use h ab to assign a spatial length to it indirectly. 
It turns out that: (i) a vector a a is spacelike iff it can be expressed in the 
form a a = h ab Af,, and (ii) if it can be so expressed, the quantity (h ab \ a Af,) is 
independent of the choice of A a . Furthermore, the signature of h ab guarantees 
that (h ab \ a Xb) > 0. So if a a is spacelike, we can take its spatial length to be 
(h ab X a A b )5, for any choice of corresponding A a . 

One final preliminary remark about classical spacetimes is needed. It is 
crucial for our purposes, as will be clear, that the compatibility condition (|4Tp 
does not determine a unique derivative operator. (It is a fundamental result 
that the compatibility condition V a gbc = determines a unique derivative 
operator if g a b is a semi-Riemannian metric, i.e., a smooth, symmetric field that 
is invertible (i.e., non-degenerate). But neither t a b nor h ab is invertible.) 

Because h ab is not invertible, we cannot raise and lower indices with it. But 
we can, at least, raise indices with it, and it is sometimes convenient to do so. 

so as to be a geodesic (with respect to V a ) iff its image could be the worldline of a free particle; 
(iii) clocks record the passage of elapsed t a f, - length along their worldlines. (Here, in contrast 
the relativistic setting, we have only massive particles to consider and, until we geomctrizc 
Newtonian gravity, do not count a particle as "free" if it is subject to "gravitational force".) 

59 By the compatibility condition, t a V„ £ a = V n (t a £ a ) = £ n V n (1) = 0. 
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So, for example, if R a bcd is the Riemann curvature tensor field associated with 
V Q , we can understand R ab cd to be an abbreviation for h bn R a ncd - 

Let us now, finally, consider Newtonian gravitation theory. In the standard 
(non-geometrized) version, one works with a flat derivative operator V a and 
a gravitational potential <p, the latter understood to be a smooth, real-valued 
function on M. The gravitational force on a point particle with mass m is given 
by — m h ab Vb (j>. (Notice that this is a spacelike vector by the orthogonality 
condition.) Using our convention for raising indices, we can also express the 
vector as: -mV°f It follows that if the particle is subject to no forces except 
gravity, and if it has four- velocity £ a , it satisfies the equation of motion 

-V°0 = rV„f\ (49) 

(Here we have just used — raV° <f> for the left side of HEJ-) It is also assumed 
that (f> satisfies Poisson's equation: 

V Q V a = 47rp, (50) 

where p is the Newtonian mass-density function (another smooth real-valued 
function on M). (The expression on the left side is an abbreviation for: 

h ab v b v a 4>.) 

In the geometrized formulation of the theory, gravitation is no longer con- 
ceived as a fundamental "force" in the world, but rather as a manifestation of 
spacetime curvature (just as in relativity theory). Rather than thinking of point 
particles as being deflected from their natural straight (i.e., geodesic) trajecto- 
ries, one thinks of them as traversing geodesies in curved spacetime. So we have 
a geometry problem. Starting with the structure (M, t a b, h ab , V a ), can we find 

9 ... h 

a new derivative operator V a , also compatible with the metrics t a b and h a , 

such that a timelike curve satisfies the equation of motion l|49|l with respect to 

g 

the original derivative operator V a iff it is a geodesic with respect to V a ? The 

following proposition (essentially due to Trautman |1965p asserts that there is 

g 

exactly one such V Q . It also records several facts about the Riemann curvature 

g g 
tensor field R a bcd associated with V a . 

In formulating the proposition, we make use of the following basic fact about 

1 2 

derivative operators. Given any two such operators V a and V a on M , there is 
a unique smooth tensor field C a bc , symmetric in its covariant indices, such that, 
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for all smooth fields a a ' b c d on M, 
i 1 

n& c...d = V « a c.d + G nc Ct r. . .d + ■ ■ ■ + G nd a c...r 

— C a nr a r - cd - ... — C nr a a "- r c d . (51) 

2 1 

In this case, we say that "the action of V a relative to that of V a is given 

l 

by C a bc " . 60 Conversely, given any one derivative operator V a on M, and any 

2 

smooth, symmetric field C a bc on M, lj?T)l defines a new derivative operator V a 
on M. (See Wald pMl p. 33].) 

Proposition 3.2.1 (Geometrization Theorem). Let (M,t a b,h ab ,V a ) be a 

classical spacetime with V a flat (R a bcd = 0). Further, let <fi and p be smooth real 

valued functions on M satisfying Poisson's equation: V a V a (f> — 4 n p. Finally, 
g 

let V a be the derivative operator on M whose action relative to that of V a is 

given by C a bc = —tb c V a 0. Then all the following hold, 
g 

(Gl) (M,t a b, h ° , V a ) is a classical spacetime. 

g 

(G2) V a is the unique derivative operator on M such that, for all timelike 
curves on M with four-velocity fields £ a , 

c v„ r = o <=> -v a (f> = ev„f. (52) 

9 9 

(G3) The curvature field R a bcd associated with V a satisfies: 

g 

R bc = 4:Trpt bc (53) 
R a "cd = (54) 
R [a (b c] d) = 0. (55) 

(|53|l is the geometrized version of Poisson's equation. The proof proceeds by 
more-or-less straight forward computation using (|51|) . 61 

2 1 
60 Clearly, if the action of V a relative to that of V a is given by C a bc , then, conversely, the 
1 2 
action of Va relative to that of V a is given by —C a bc . In the sum on the right side of 1511 . 

there is one term involving C a bc for each index in a° d - In each case, the index in question 

is contracted with C a bc , and the term carries a coefficient of +1 or —1 depending on whether 

the index in question is in covariant (down) or contravariant (up) position. (The components 

of C a bc in a particular coordinate system are obtained by subtracting the Christoffel symbols 

1 2 
associated with V a (in that coordinate system) from those associated with V Q .) 

61 Here is a sketch. By 1511 . 

9 

V a t bc = Va t bc + C T ab t rc + C" ac t br = Va t bc + ( — t ab V <f>) t rc + { — tac V0) t br . 
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We can also work in the opposite direction. In geometrized Newtonian grav- 

9 

itation theory, one starts with a curved derivative operator V Q satisfying (|53|) . 

Q54)). iJSSJ, and with the principle that point particles subject to no forces (ex- 

g 

cept "gravity") traverse geodesies with respect to V Q . I|54|) and l|55|l function 
as integrability conditions that ensure the possibility of working backwards to 
recover the standard formulation in terms of a gravitational potential and flat 
derivative operator. 62 We have the following recovery, or de-geometrization, 
theorem (also essentially due to Trautman ^965 ). 

The first term in the far right sum vanishes by the compatibility condition 1471 : the second and 
third do so by the orthogonality condition 1461 since, for example, (V r 0) t br = (h rm t br ) V m </>. 

9 

So V a is compatible with t bc . Much the same argument shows that it is also compatible with 
h ab . This give us (Gl). 

9 

For (G2), let V a (temporarily) be an arbitrary derivative operator on M whose action 
relative to that V a is given by some field C a bc . Let p be an arbitrary point in M, and let £ a 
be the four- velocity field of an arbitrary timelike curve through p. Then, by 1511 . 

c v„ e = rv„ e - c a rn ec- 

It follows that V n will satisfy (G2) iff C a rn £ r £ n = -W a <f> or, equivalently, 

[C a rn +(v»ufr = o, 

for all future-directed unit timelike vectors at all points p. But the space of future-directed 
unit timelike vectors at any p spans the tangent space M p there, and the field in brackets is 

9 

symmetric in its covariant indices. So, Vn will satisfy (G2) iff C a rn = — (V a </>) t rn everywhere. 

9 

Finally, for (G3) we use the fact that R a bcd can be expressed as a sum of terms involving 
R a bcd and C a bc (see Wald [T5g4l p. 184]), and then substitute for C a bc : 

g 

R bed = R bed + 2 V[ c C d ](, + 2 C b y c C d j n 

= R\cd ~ 2t 6 [ d V c] V> = -2t b[d V c] w- 

(Here C r b yJJ a d ^ n turns out to be by the orthogonality condition, and V[ c C a d ^ b turns out to 
be — tb[dS c] V a </> by the compatibility condition. For the final equality we use our assumption 
that R a bcd = 0.) l54l and 1551 now follow from the orthogonality condition and (for 1551 ) 
from the fact that V[ c V a '</> = for any smooth function cj>. Contraction on 'a' and l d' yields 

Rbc = t bc {W a V a <p). 

So 1531 follows from our assumption that V a Va <f> = 4 7rp (and the fact that V« V°(/i = 
V a V a <f>). 

62 I am deliberately passing over some subtleties here. Geometrized Newtonian gravitation 
theory comes in several variant formulations. (See Bain 2004 for a careful review of the 
differences.) The one presented here is essentially that of Trautman 1965 . In other weaker 
formulations (such as that in Kiinzle 119721 '). condition 1541 is dropped, and it is not possible 
to fully work back to the standard formulation (in terms of a gravitational potential and fiat 
derivative operator) unless special global conditions on spacetime structure are satisfied. 
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Proposition 3.2.2 (Recovery Theorem). Let (M,t a i,,h ab ,V a ) be a classical 
spacetime that, together with a smooth, real-valued function p on M , satisfies 
conditions |55J) . \54\j , \55)) . Then, at least locally (and globally if M is, for 
example, simply connected), there exists a smooth, real-valued function <f> on M 
and a flat derivative operator V a such that all the following hold. 
(Rl) (M,t a b,h ab ,^/ a ) is a classical spacetime. 

(R2) For all timelike curves on M with four-velocity fields £ a , the geometri- 

zation condition is satisfied. 
(R3) V a satisfies Poisson's equation: V a V a <p = Air p. 

g 

The theorem is an existential assertion of this form: given V a satisfying 

certain conditions, there exists (at least locally) a smooth function <f> on M and 

9 

a flat derivative operator V a such that V a arises as the "geometrization" of the 
pair (V a ,0). But, as claimed at the beginning of this section, we do not have 
uniqueness unless special boundary conditions are imposed on <p. 

For suppose V a is flat, and the pair (V a ,(/>) satisfies (Rl), (R2), (R3). Let 
tp be any smooth function (with the same domain as 4>) such that V a V b tp 
vanishes everywhere, but V b ifj does not. 63 If we set (ft = <f> + ip, and take V a 

9 

to be the derivative operator relative to which the action of V a is given by 
C a bc = ~tb c V a (j>, then V a is flat and the pair (V a ,0) satisfies conditions (Rl), 
(R2), (R3) as well. 64 

63 We can think of \7 b tp as the "spatial gradient" of ip. The stated conditions impose the 
requirement that SJ b ip be constant on all spacclikc submanifolds ("time slices"), but not vanish 
on all of them. 

64 It follows directly from the way V a was defined that the pair (V a , <f>) satisfies conditions 
(Rl) and (R2). (The argument is almost exactly the same as that used in an earlier note 
to prove (Gl) and (G2) in the Geometrization Theorem.) What must be shown that is that 
V a is flat, and that the pair (Va,</>) satisfies Poisson's equation: V a Va 4> = 47rp. We do 
so by showing that (i) ~R a bcd = R a bcd , (ii) V a V a ip = 0, and (iii) V a V a a = V a V a a, for 
all smooth scalar fields a on M. (It follow immediately from (ii) and (iii) that V a V a <j> = 

V a Va <i> + V a Va V = V a Va</-+ V a VaV =47rp.) 

We know from the uniqueness clause of (G2) in the Geometrization Theorem that the action 

9 

of V a with respect to V B is given by the field C a bc = — t bc V a </>. It follows that the action of 
Va relative to that of V tt is given by C\ c = -C a bc + C a bc = -t bc V a (-0 + </>) = h c V a ip. 
So, arguing almost exactly as we did in the proof of (G3) in the Geometrization Theorem, we 
have 

R\cd = R%od + 2t M aV c] V>. (56) 
Now it follows from V a V 6 V = that 

V c V a V = t c (rV„V a ^), (57) 
where t ab = t a t b , and £ n is any smooth future-directed unit timelike vector field on M. 
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(V a ,0) 

/ 

(V a ,0) 

But, because V h ip is non-vanishing (somewhere or other), the pairs (V Q , (j>) 

_ 9 

and (V a ,<£) are distinct decompositions of V a . Relative to the first, a point 

particle (with mass to and four-velocity £ a ) has acceleration £ n V„ £ a and is 
subject to a gravitational force — to V a (p. Relative to the second, it has ac- 
celeration £™ V r £ a = £™ V„ £ a — V a tp and is subject to a gravitational force 
-to V a ~cp = -to V a </> ~mV a ip. 

As suggested at the beginning of the section, we can take this non uniqueness 
of recovery result to capture in precise mathematical language the standard 
claim that Newtonian gravitational force is a gauge quantity. By the argument 
just given, if we can take the force on a point particle with mass to to be 
— to V a 0, we can equally well take it to be — m V a (</> + ip), where ip is any field 
satisfying V a V b ip = 0. 



3.3 Recovering Global Geometric Structure from "Causal 
Structure" 

There are many interesting and important issues concerning the global struc- 
ture of relativistic spacetimes that might be considered here - the nature and 
significance of singularities, the cosmic censorship hypothesis, the possibility of 
"time travel" , and others. 65 But we limit ourselves to a few remarks about one 
rather special topic. 

In our discussion of relativistic spacetime structure, we started with geomet- 
ric models (M, g a b) exhibiting several levels of geometric structure, and used 

Hence, t 6 [ d V c j V a V> = t b t[dfc](£™V„V a V) = 0. This, together with gives us (i). And 

(ii) follows directly from 1571 . Finally, for (iii), notice that 

V a V a a = h ar V r V a a = h ar V r S/aa = h ar (VrV a a + C n ra \7„a) 
= V°V„« + h ar t ra (V n V)(V„ a) = V a V„ a. 
The final equality follows from the orthogonality condition. 

65 Earman 1995 offers a comprehensive review of many of them. (On the topic of singular- 
ities, I can also recommend Curiel 1999 .) 
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the latter to define the (two-place) relations -C and < on M. 66 The latter 
are naturally construed as relations of "causal connectibility (or accessibility)" . 
The question arises whether it is possible to work backwards, i.e., start with 
the pair (M, <C) or (M, <), with M now construed as a bare point set, and 
recover the geometric structure with which we began. The question is sug- 
gested by long standing interest on the part of some philosophers in "causal 
theories" of time or spacetime. It also figures centrally in a certain approach to 
quantum gravity developed by Rafael Sorkin and co-workers. (See, e.g., Sorkin 
P^|iortfTcoming| .) 

Here is one way to make the question precise. (For convenience, we work 
with the relation <C.) 

Let (M, g ab ) and (M, g ab ) be (temporally oriented) relativistic spacetimes. 
We say that a bijection <f> : M — ► M between their underlying point sets is a 
causal isomorphism if, for all p and q in M, 

p<^q <f>{p) < <P{q). (58) 

Now we ask: Does a causal isomorphism have to be a homeomorphism? a 
diffcomorphism? a conformal isometry? 67 

Without further restrictions on (M, g ab ) and (M,g ab ), the answer is certainly 
'no' to all three questions. Unless the "causal structure" (i.e., the structure de- 
termined by -C) of a spacetime is reasonably well behaved, it provides no useful 
information at all. For example, let us say that a spacetime is causally de- 
generate if p <C q for all points p and q. Any bijection between two causally 
degenerate spacetimes qualifies as a causal isomorphism. But we can certainly 
find causally degenerate spacetimes whose underlying manifolds have different 
topologies (e.g., Godel spacetime and a rolled- up version of Minkowski space- 
time). 

There is a hierarchy of "causality conditions" that is relevant here. (See, 
e.g., Hawking and Ellis [19721 section 6.4].) They impose, with varying degrees 
of stringency, the requirement that there exist no closed, or "almost closed", 
timelike curves. Here are three. 

66 Recall that p <C q holds if there is a future-directed timelike curve that runs from p to q; 
and p < q holds if there is a future-directed causal curve that runs from p to q. 

67 We know in advance that a causal isomorphism need not be a (full) isometry because 
conformally equivalent metrics g ab and f2 2 g ab on a manifold M determine the same relation 
■C. The best one can ask for is that it be a conformal isometry, i.e., that it be a diffeomorphism 
that preserves the metric up to a conformal factor. 
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chronology: There do not exist closed timelike curves. (Equivalently, for all 
p, it is not the case that p <C p.) 

future (resp. past) distinguishability: For all points p, and all sufficiently small 
open sets O containing p, no future directed (resp. past directed) timelikc 
curve that starts at p, and leaves O, ever returns to O. 

strong causality: For all points p, and all sufficiently small open sets O con- 
taining p, no future directed timelike curve that starts in O, and leaves O, 
ever returns to O. 

It is clear that strong causality implies both future distinguishability and past 
distinguishability, and that each of the distinguishability conditions (alone) im- 
plies chronology. Standard examples (Hawking and Ellis |1972j ) establish that 
the converse implications do not hold, and that neither distinguishability con- 
dition implies the other. 

The names "future distinguishability" and "past distinguishability" are easily 
explained. For any p, let I + {p) be the set {9 : p « 9} and let I~(p) be the 
set {q : q <C p}. Then future distinguishability is equivalent to the requirement 
that, for all p and q, 

I + (p) = I+(q) ^p = q. 

And the counterpart requirement with / + replaced by I~ is equivalent to past 
distinguishability. 

We mention all this because it turns out that one gets a positive answer to 
all three questions above if one restricts attention to spacetimes that are both 
future and past distinguishing. 

Proposition 3.3.1. Let (M,g a b) and (M,g ab ) be (temporally oriented) rela- 
tivistic spacetimes that are past and future distinguishing, and let <f> : M — > M 
be a causal isomorphism. Then <j> is a diffeomorphism and preserves g a b up to 
a conformal factor, i.e., 4>+g a b is conformally equivalent tog ab . 

A proof is given in Malament |1977j . A counterexample given there also 
shows that the proposition fails if the initial restriction on causal structure is 
weakened to past distinguishability or to future distinguishability alone. 
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